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ABSTRACT 



The present invention provides p53 proteins with altered 
tetramerization domains that retain wild-type p53 function, 
and the ability to form tetramers and have at least one of the 
following characteristics: (1) do not hetero-oligomerize with 
wild-type p53 or tumor-derived p53 mutants, and (2) 
restricted DNA binding specificity from an alteration in the 
way that the tetramerization domain orients the DNA bind- 
ing domains of a p53 tetramer relative to one another. The 
invention also provides nucleic acids encoding the above 
proteins and methods of enhancing the cellular response to 
DNA damaging agents, treating diseases characterized by 
abnormal cell proliferation, and inducing immune tolerance 
to facilitate transplants and treatment of autoimmune dis- 
ease, by administration of proteins of the invention or 
nucleic acid sequences enc ding the proteins of the inven- 
ti n. 

15 Claims, 4 Drawing Sheets 
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P53 PROTEINS WITH ALTERED 
TETRAMERIZATION DOMAINS 

I. FIELD OF THE INVENTION 

The present invention relates to the field of p53 proteins 
with altered oligomerization domains, polynucleotide 
sequences encoding them, and their use in therapy. 

H. BACKGROUND OF THE INVENTION 

Wild-type (wt) p53 is a sequence-specific DNA binding 
protein found in humans and other rnammals, which has 
tumor suppressor function [See, e.g., Harris (1993), Science, 
262:1980-1981]. The wild-type p53 protein functions to 
regulate cell proliferation and cell death (also known as 
apoptosis). It also participates in the response of the cell to 
DNA damaging agents [Harris (1993), cited above]. In more 
than half of all human tumors p53 is inactivated by muta- 
tions and is therefore unable to arrest cell proliferation or 
induce apoptosis in response to DNA damaging agents, such 
as radiation and chemotherapeutics commonly used for 
cancer treatment The nucleotide and amino acid sequences 
of human p53 are reported below as SEQ ID NOS: 1 and 2, 
respectively [Zakut-Houri et al, (1985), EMBO J., 
4:1251-1255; GenBank Code Hsp53], Hie amino acid 
sequence of p53 is conserved across evolution [Soussi et al, 
(1990), Oncogene, 5:945-952], suggesting that its function 
is also conserved. 

At the biochemical level, p53 is a tetrameric DNA 
sequence-specific transcription factor. Its DNA binding and 
transcriptional activities are required for p53 to suppress 
tumor growth [Pietenpol et al, (1994), Proc. Natl. Acad. Sci. 
USA, 91:1998-2002]. p53 forms homotetramers in the 
absence of DNA and maintains its tetrameric stoichiometry 
when bound to DNA [Kraiss et al, (1988), J. Virol., 
62:4737-4744; Stenger et al, (1992), Mol. Carcinog., 
5:102-106; Sturzbecher et al, (1992), Oncogene, 
7:1513-1523; Friedman et al, (1993), Proc. Natl. Acad. Sci. 
USA, 90:3319-3323; Halazonetis and Kandil (1993), 
EMBO J., 12:5057-5064; and Hainaut et al, (1994), Onco- 
gene, 9:299-303]. Consistent with the observation that p53 
binds DNA as a homotetramer, the known physiologically 
relevant DNA sites recognized by p53 contain four penta- 
nucleotide repeats [El-DeiryDeiry et al, (1993), Cell, 
75:817-825; Wu et al, (1993), Genes Dev., 7:1126-1132; 
Kastan et al, (1992), Cell, 71:587-597]. Each pentanucle- 
otide repeat is recognized by one subunit of the p53 
homotetramer [Halazonetis and Kandil (1993), cited above; 
Cho et al, (1994), Science, 265:346-355]. The ability of p53 
to bind DNA in a sequence-specific manner maps to amino 
acid residues 90-290 of SEQ ID NO: 2 [Halazonetis and 
Kandil (1993), cited above; Pavletich et al, (1993), Genes 
Dev., 7:2556-2564; Wang et al, (1993), Genes Dev., 
7:2575-2586]. 

Once bound to DNA, p53 activates gene transcription 
from neighboring promoters. The ability of p53 to activate 
gene transcription has been mapped to within amino acid 
residues 1-90 of SEQ ID NO: 2 [Fields et al, (1990), 
Science, 249:1046-1049]. 

The C-terminus of the human p53 turn r suppressor 
protein (i.e., amino acids 290-393 of human p53, SEQ ID 
NO: 2) has two functions. It induces p53 oligomerization 
and it regulates p53 DNA binding by controlling the con- 
formation of p53 tetramers. These two functi ns map to 
independent regions. Oligomerization maps to amino acid 
residues 322-355 of SEQ ID NO: 2 [Wang et al, (1994), 
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Mot. Cell. Biol, 14:5182-5191; Clore et al, (1994), Science, 
265:386-391]. Regulation of DNA binding maps to amin 
acid residues 364-393 of human p53 [SEQ ID NO: 2] or t 
the corresponding region encompassing residues 361-390 of 
mouse p53 [SEQ ID NO: 15] [Hupp et al, (1992), Cell, 
71:875-886; Halazonetis et al, (1993), EMBO J., 
12:1021-1028; Halazonetis and Kandil (1993), cited above; 
Genbank locus Mmp53r]. 

Mutations of the p53 protein in most human tumors 
involve the sequence-specific DNA binding domain, so that 
the mutant proteins are unable to bind DNA [Bargonetti et 
al, (1992), Genes Dev., 6:1886-1898]. The loss of p53 
function is critical for tumor development. Introduction of 
wild-type p53 into tumor cells leads to arrest of cell prolif- 
eration or cell death [Finlay et al, (1989), Cell, 
57:1083-1093; Eliyahu et al, (1989), Proc. Nad. Acad. Sci. 
USA, 86:8763-8767; Baker et al, (1990), Science, 
249:912-915; Mercer et al, (1990), Proc. Natl. Acad. Sci. 
USA, 87:6166-6170; Diller et al, (1990), Mol. Cell. Biol., 
10:5772-5781; Isaacs et al, (1991), Cancer Res., 
51:4716-4720; Yonish-Rouach et al, (1993), Mol. CeU. 
Biol., 13:1415-1423; Lowe et al, (1993), Cell, 74:957-967; 
Fujiwaraetal, (1993), Cancer Res., 53:4129-4133; Fujiwara 
et al, (1994), Cancer Res., 54:2287-2291]. Thus, introduc- 
tion of wild-type p53 into tumor cells has been proposed to 
be a viable approach to treat human cancer [see, e.g., 
International Patent Applications WO 9406910 A, WO 
9416716 A, WO 9322430 Al, EP 390323, and EP 475623 
Al]. 

However, most tumors express mutant versions of p53 at 
high levels [Harris (1993), cited above]. Because these p53 
mutants have intact oligomerization domains, they form 
hetero- tetramers with wild-type p53. Such hetero-tetramers 
are biochemically inactive or characterized by considerably 
reduced activity compared to wild-type p53 tetramers [Mil- 
ner and Medcalf (1991), Cell, 65:765-774; Bargonetti et al, 
(1992), cited above; Farmer et al, (1992), Nature, 
358:83-86; Kem et al, (1992), Science, 256:827-830]. 
Thus, if one were to treat human cancer by introduction of 
wild-type p53 in tumor cells, the effectiveness of this 
therapeutic approach would be limited by the presence of 
mutant p53 in the cancer cells. 

Thus, there is a need in the art for the identification of 
compositions which are not inhibited by endogenous p53, as 
well as for methods for the uses of such compositions for 
therapeutic purposes. 

ffl. SUMMARY OF THE INVENTION 

The present invention provides novel modified p53 pro- 
teins, including preferably chimeric proteins formed by the 
association of sequences of p53 and sequences of other 
selected proteins, which novel proteins have desirable func- 
tional characteristics. 

In one aspect, the present invention provides p53 proteins 
with altered tetramerization domains characterized by the 
ability to form tetramers, bind DNA and activate transcrip- 
tion indistinguishably from wild-type p53, but incapable of 
forming hetero-tetramers with p53 proteins that have an 
intact tetramerization domain, such as wild-type p53 or 
tumor-derived p53 mutants. These p53 proteins of the inven- 
tion are preferably chimeric proteins, characterized by dis- 
ruption of the native p53 tetramerization domain and inser- 
tion of a heterologous oligomerization domain in a way that 
preserves tetramerization. 

In another aspect, the invention provides p53 proteins 
characterized by restricted DNA binding specificity from an 
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alteration in the way the tetramerization domain orients the 
DNA binding domains of a p53 tetramer relative to one 
another. These proteins are characterized by deletion of all 
or a significant portion of, or disruption of, the region 
between the DNA binding domain (amino acid residues 
90-289 of human p53 ofSEQ ID NO: 2) and tetramerization 
domain (amino acid residues 322-355 of human p53 of SEQ 
ID NO: 2). This region (spanning residues 290-321 of 
human p53 of SEQ ID NO: 2) is considered an extension of 
the p53 tetramerization domain. 

In still another aspect, the invention provides p53 proteins 
with both of the characteristics described above, namely: (1) 
ability to form tetramers, but inability to hetero-tetramerize 
with p53 proteins having an intact tetramerization domain, 
such as wild-type p53 or tumor-derived p53 mutants; (2) 
restricted DNA binding specificity from an alteration in the 
way that the tetramerization domain orients the DNA bind- 
ing domains of a p53 tetramer relative to one another. 

In still another aspect, the invention provides further 
modifications of the p53 proteins provided above. These 
modifications include: (1) altered transcription activation 
sequences (amino acid residues 1-90 of human p53 of SEQ 
ID NO: 2); (2) insertion of one or more nuclear localization 
signals; and (3) replacement of selected regions of the p53 
proteins with homologous regions of non-human p53 pro- 
tein, as well as conventional modifications such as insertion 
or deletion or substitution of individual amino acid residues 
throughout the sequence, and the use of linkers between 
portions of the chimeric proteins. 

Still another aspect of this invention provides p53 proteins 
having two or more of the above-described modifications. 

In yet another aspect, the present invention provides a 
nucleic acid sequence encoding a protein of the invention. 
These nucleic acids may be inserted into an appropriate 
vector for delivery to patients for gene therapy. Alternatively 
the nucleic acids may be inserted into a vector for in vitro 
expression of a protein of the invention, which is then 
introduced into patients. 

In a further aspect, the invention provides a method of 40 
treating an individual having a condition characterized by 
abnormal cell proliferation by delivering a protein or, pref- 
erably, a nucleic acid sequence, of the invention to the 
patient. 

Other aspects and advantages of the present invention are 
described further in the following detailed description of the 
preferred embodiments thereof. 



IV BRIEF DESCRIPTION OF THE DRAWINGS 50 

FIG. 1A schematically illustrates wild-type p53. The 
amino acid numbering, which is also maintained throughout 
FIGS. 1A-1K, refers to the residues of human p53 as 
indicated in SEQ ID NO: 2. The entire length of human p53 
is 393 amino acids. Symbols for the DNA binding domain 
(residues 90-290 of SEQ ID NO: 2) (checkerboard bar) and 
the oligomerization domain (residues 322-355 of SEQ ID 
NO: 2) (solid bar) are maintained throughout FIGS. 1A-1K. 

FIG. IB schematically illustrates a heterologous dimer- 
ization domain p53 chimeric protein containing residues 
1-346 of p53 [SEQ ID NO: 2], a glutamic acid for cloning 
convenience and a GCN4 dimerization domain correspond- 
ing to residues 253-281 of GCN4 of SEQ ID NO: 4 (hatched 
bar). 

FIG. 1C schematically illustrates a heterologous dimer- 
ization domain p53 chimeric protein containing residues 
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1-347 of p53 [SEQ ID NO: 2] and residues 253-281 f 
GCN4 of SEQ ID NO: 4 (hatched bar). 

FIG. ID schematically illustrates a heterologous dimer- 
ization domain chimeric p53 protein containing an insertion 
of a glutamic acid and residues 253-281 of GCN4 f SEQ 
ID NO: 4 (hatched bar) between residues 346 and 347 of p53 
[SEQ ID NO: 2]. 

FIG. IE schematically illustrates a heterologous dimer- 
ization domain chimeric p53 protein containing residues 
1-356 of human p53 [SEQ ID NO: 2] with a mutation within 
the native p53 oligomerization domain (leucine 344 to 
alanine) linked to the dipeptide glycine-asparagine and 
residues 253-281 of GCN4 of SEQ ID NO: 4 (hatched bar). 

FIG. IF schematically illustrates a heterologous tetramer- 
ization domain chimeric p53 protein which contains residues 
1-334 of human p53 [SEQ ID NO: 2] linked to an aspar- 
agine and then to a tetrameric variant of GCN4 residues 
249-281 of SEQ ID NO: " (hatched bar). 

FIG. 1G schematically ^ustrates a heterologous tetramer- 
ization domain chimeric p53 protein which contains residues 
1-325 of human p53 [SEQ ID NO: 2] linked to the tripeptide 
arginine-glycine-asparagine [SEQ ID NO: 7] and then to a 
tetrameric variant of GCN4 residues 249-281 of SEQ ID 
NO: 6 (hatched bar). 

FIG. 1H schematically illustrates a heterologous tetramer- 
ization domain chimeric p53 protein which contains residues 
1-323 of human p53 [SEQ ID NO: 2] linked to the hexapep- 
tide argim^e-glycme-glydne-asparagine-proline-glutamic 
acid [SEQ ID NO: 8] and then to a tetrameric variant of 
GCN4 residues 250-281 of SEQ ID NO: 6 (hatched bar). 

FIG. 1J schematically illustrates a heterologous tetramer- 
ization domain chimeric p53 protein which contains residues 
1-300 of human p53 [SEQ ID NO: 2] linked to the pen- 
tapeptide glycine-glycine-asparagine-glutamine-alanine 
[SEQ ID NO: 9] and then to a tetrameric variant of GCN4 
residues 250-281 of SEQ ID NO: 6 (hatched bar). 

FIG. IK schematically illustrates a heterologous tetramer- 
ization domain chimeric p53 protein which contains residues 
1-325 of human p53 [SEQ ID NO: 2] linked to the tripeptide 
arginine-glycine-asparagine [SEQ ID NO: 7], a tetrameric 
variant of GCN4 residues 249-28 1 of SEQ ID NO: 6 and an 
isoleucine (hatched bar), and then followed by residues 
352-393 of human p53 [SEQ ID NO: 2]. 

FIG.2A schematically illustrates the chimeric p53 protein 
of FIG. IF, which serves as a paradigm to indicate the 
various modifications that can be introduced into any of the 
p53 proteins of this invention (FIGS. 2B-2F). Symbols for 
the p53 DNA binding domain (checkerboard bar), the trun- 
cated p53 oligomerization domain (solid bar) and the het- 
erologous tetramerization domain (hatched bar) are main- 
tained in FIGS. 2B-2F. Also, the numbering throughout 
FIGS. 2A-2F refers to the residues of human p53 as 
indicated in SEQ ID NO: 2. 

FIG. 2B schematically illustrates a deletion of residues 
300-327 of human p53 [SEQ ID NO: 2], that confers novel 
DNA binding specificities. 

FIG. 2C schematically illustrates the substitution of the 
transcription activation domain of p53 with that of the 
herpes simplex virus protein VP16 (reverse hatched bar), 
also known as a trans-inducing factor. 

FIG. 2D schematically illustrates the insertion of a 
nuclear localization signal (NLS) between amino acid resi- 
dues 80 and 81 of p53 (SEQ ID NO: 2] (horizontal lined 
bar). The abbreviation a.a. represents amino acids. 

FIG. 2E schematically illustrates the substitution of 
human p53 residues 3-80 of SEQ ID NO: 2 with the 
corresponding xenopus sequences (cross-hatched bar). 
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FIG. 2F schematically illustrates two mutations that 
enhance function of the p53 proteins [SEQ ID NO: 2] of this 
invention, such as substitution of Arg 174 with Gin, or Arg 
175 with Leu. 

FIG. 3A schematically illustrates a wild-type p53 tetramer 
bound to a DNA site containing four contiguous pentanucle- 
otides repeats. For FIGS. 3A-3H, the p53 DNA binding 
domains are shown as circles, the oligomerization domain as 
a thin rectangle, the linker between the DNA binding and 
oligomerization domains as curved lines, the DNA as a thick 
rectangle and the specific pentnucleotides as arrows. 

FIG. 3B schematically illustrates a wild-type p53 tetramer 
bound to a DNA site containing two pentanucleotide pairs 
separated by a 20-30 nucleotide insert 

FIGS. 3C and 3D schematically illustrate a p53 tetramer 
with antiparallel alignment of its oligomerization domains 
and a short linker between the DNA binding and oligomer- 
ization domains. Such a p53 tetramer cannot bind to a DNA 
site containing four contiguous pentnucleotides repeats 
(FIG. 3C) t but can bind to a DNA site containing two 
pentanucleotide pairs separated by a 20-30 nucleotide insert 
(FIG. 3D). 

FIG. 3E schematically illustrates a chimeric p53 tetramer 
with parallel alignment of its oligomerization domains 
bound to a DNA site containing four contiguous pentanucle- 
otides repeats. 

FIG. 3F schematically illustrates a chimeric p53 tetramer 
with parallel alignment of its oligomerization domains 
bound to a DNA site containing two pentanucleotide pairs 
separated by a 20-30 nucleotide insert. 

FIGS. 3G and 3H schematically illustrate a chimeric p53 
tetramer with parallel alignment of its oligomerization 
domains and a short linker between the DNA binding and 
olig merization domains. Such a p53 tetramer can bind to a 
DNA site containing four contiguous pentnucleotides 
repeats (FIG. 3G), but not to a DNA site containing two 
pentanucleotide pairs separated by a 20-30 nucleotide insert 
(FIG. 3H). 
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V. DETAILED DESCRIPTION OF THE 
INVENTION 

The present invention provides p53 proteins with modi- 
fications in the native p53 sequence. These modifications, 
which do not interfere with its native tumor-suppressor 
function, provide the protein with at least one of die fol- 
lowing functional characteristics: (1) the ability to bind 
DNA and activate transcription like wild-type p53, but to not 
hetero-oligomerize with wild-type p53 or tumor-derived p53 
mutants; and (2) restricted DNA binding specificity from an 
alteration in the way that the tetramerization domain orients 
the DNA binding domains of a p53 tetramer relative to one 
another. In addition nucleic acids encoding such proteins 
and methods of using such proteins or nucleic acid 
sequences therapeutically are provided. All references to 
human p53 residue numbers refer to the numbering scheme 
provided by Zakut-Houri et al, (1985) [cited above). The 
nucleotide and amino acid sequences of human p53 are 
reproduced as SEQ ID NOS: 1 and 2, respectively. 
A. p53 Proteins with Altered Tetramerization Domains 
Definitions: A di merization domain is defined as a domain 
that allows formation f dimers, while a tetramerization 
domain is defined as a domain that allows formation of 65 
tetramers. An oligomerization domain allows formation of 
lig mers, which can be of any subunit stoichiometry (of 
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c urse greater than one). Thus, the term oligomerization 
domain is m re general and encompasses both dimerization 
and tetramerization domains (which direct formation of 
oligomers of subunit stoichiometrics 2 and 4, respectively). 

The term chimeric protein refers to a protein containing 
sequences from two different proteins, for example from p53 
and GCN4. 

AL p53 Associated with a Heterologous Dimerization 
Domain 

In one embodiment, a protein of this invention is com- 
prised of a p53 protein bearing a partial functional inacti- 
vation of its tetramerization domain and a heterologous 
dimerization domain. Thus, certain regions of the p53 tet- 
ramerization domain must be maintained (so that the chi- 
meric protein can form tetramers, in spite of containing a 
heterologous dimerization domain), while other regions are 
inactivated (so that tetramerization is dependent on the 
heterologous dimerization domain). The p53 tetramerization 
domain maps to residues 322-355 of SEQ ID NO: 2 [Wang 
et al, (1994), cited above; Qore et al, (1994), cited above]. 
According to this invention, a disruption of the p53 tet- 
ramerization domain, involving residues 335-348 of SEQ 
ID NO: 2 or a subset of these residues, sufficienUy disrupts 
the function of this domain, so that it can no longer drive 
tetramerization with wild-type p53 or tumor-derived p53 
mutants. At the same time, however, introduction of a 
heterologous dimerization domain reestablishes the ability 
to form tetramers, which is mediated both by the heterolo- 
gous dimerization domain and by the residual tetrameriza- 
tion domain of p53. 

A heterologous dimerization domain is defined herein as 
a sequence of amino acids heterologous to p53 and capable 
of forming homodimers. One example of a dimerization 
domain is the leucine zipper (LZ) element. A leucine zipper 
has been defined as stretch of about 35 amino acids con- 
taining 4-5 leucine residues separated from each other by 
six amino acids [Maniatis and Abel (1989), Nature, 
341:24-25]. The leucine zipper occurs in a variety of 
eukaryotic DNA binding proteins, such as GCN4, C/EBP, 
c-Fos, c-Jun, c-Myc and c-Max. 

Heterologous dimerization domains may also be selected 
from other proteins, such as the retinoic acid receptor, the 
thyroid hormone receptor or other nuclear hormone recep- 
tors [Kurokawa et al, (1993), Genes Dev., 7:1423-1435] or 
from the yeast transcription factors Gal4 and HAP1 [Mar- ■ 
monstein et al, (1992), Nature, 356:408-414; Zhang et al, 
(1993), Proc. Natl. Acad. Sci. USA, 90:2851-2855]. One of 
skill in the art may identify additional suitable dimerization 
domains, including artificial dimerization domains [O'Shea 
et al, (1992), Cell, 68:699-708; Krylov et al, (1994), EMBO 
J., 13:2849-2861]. For ease in description, the leucine 
zipper of the yeast transcription factor GCN4 is used herein 
as the exemplary dimerization domain. The nucleotide and 
amino acid sequences of GCN4 are presented as SEQ ID 
NO: 3 and NO: 4, respectively. The numbering of the GCN4 
nucleotide and amino acid residues follows Hinnenbusch 
(1984) Proc. Natl. Acad. Sci. USA, 81:6442-6446 and 
Ellenberger et al, (1992) Cell, 71:1223-1237, respectively. 
The coding region of GCN4 is encompassed by nucleotide 
778-1623 of SEQ ID NO: 3. The nucleotide and amino acid 
sequence are found in GenBank under the Code Yscgcn4. 

Partial functional inactivation of the p53 tetramerization 
domain can be accomplished by deletions, insertions and/or 
amino acid substitutions targeting part of this domain. Such 
mutations should involve residues 335-348 of SEQ ID NO: 
2 or a subset of these residues, but need not be confined 
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within the p53 tetramerization domain. For example, a 
deletion whose N-terminal boundary is within residues 
335-348 of SEQ ID NO: 2 may extend as far as the p53 
C-terminus. The precise boundaries of the mutations will 
depend on the nature of the heterologous dimerization 5 
domain and the presence, if any, of amino acid sequences 
introduced for cloning or other purposes between p53 and 
the heterologous dimerization domain. For example, in one 
preferred embodiment, residues 1-346 of human p53 [SEQ 
ID NO: 2] arc juxtaposed to the dimerization domain of 
GCN4 (residues 253-281 of GCN4 SEQ ID NO: 4) through 
a glutamic acid linker (FIG. IB). In another embodiment 
residues 1-347 of human p53 [SEQ ID NO: 2] are juxta- 
posed to residues 253-281 of GCN4 [SEQ ID NO: 4] (FIG. 
1C). 

Alternatively, the function of the p53 tetramerization 
domain may be partially disrupted by insertion of the 
heterologous dimerization domain within the p53 tetramer- 
ization domain and preferably between residues 335 and 348 ^ 
of human p53 [SEQ ID NO: 2]. In a preferred embodiment 
(FIG. ID) a glutamic acid and residues 253-281 of GCN4 
[SEQ ID NO: 4] are inserted between residues 346 and 347 
of human p53 [SEQ ID NO:2], Alternatively, the function of 
the p53 tetramerization domain may be partially disrupted ^ 
by insertions, deletions or amino acid substitutions, while 
the heterologous dimerization domain is inserted outside the 
boundaries of the p53 tetramerization domain. The muta- 
tions should again target residues 335-348 of human p53 
[SEQ ID NO: 2], or a subset thereof. In one embodiment, the ^ 
function of the p53 [SEQ ID NO: 2] tetramerization domain 
is inactivated by substitution of residue 344 by alanine. This 
mutation only partially disrupts the function of the p53 
tetramerization domain (see Examples section). A heterolo- 
gous dimerization domain can then be inserted even outside 35 
the p53 tetramerization domain, for example following 
residue 356 of human p53 [SEQ ID NO: 2], to reestablish 
tetramer formation (FIG. IE), 

At least two novel features characterize the class of 
proteins described here. First these chimeric proteins form 40 
tetramers. This was unexpected because the disruption in the 
p53 tetramerization domain is of sufficient magnitude to 
disrupt p53 tetramers into monomers. Yet, when the heter- 
logous dimerization domain is introduced, the chimeric 
protein forms tetramers, rather than dimers, as would be 45 
expected A second novel feature of these chimeric proteins 
is that their ability to form tetramers with wild-type p53 or 
with tumor-derived p53 mutants is greatly reduced. This is 
surprising, because these proteins must utilize p53 structural 
determinants to form tetramers (recall that in the invention 50 
a heterologous dimerization domain is juxtaposed to the p53 
sequence). For example the chimeric protein of FIG. IB that 
retains residues 1-346 of human p53 [SEQ ID NO: 2], 
retains all the critical residues (Gly 334, Arg 337, Phe 341 
and Leu 344 of SEQ ID NO: 2) mat are required to make the 55 
inter-subunit contacts for tetramer formation (see Example 3 
and Clore et al, (1994), cited above). Yet this chimeric 
protein fails to form tetramers with wild-type p53 or tumor- 
derived p53 mutants. 

The specific embodiments described in this section dis- 60 
play the desired functional characteristics, namely of form- 
ing tetramers, binding DNA and activating transcription 
equivalently to wild-type p53, and in addition fail to asso- 
ciate or associate very weakly with tumor-derived p53 
mutants (see Examples section). Additional examples of p53 65 
proteins of this invention may be generated by one of skill 
in the art given the teachings contained herein. 



A2. p53 Associated with a Heterologous Tetramerization 
Domain 

In this embodiment of the invention, the p53 protein bears 
a partial or preferably a complete functional inactivation of 
its tetramerization domain and contains a heterologous tet- 
ramerization domain. 

A heterologous tetramerization domain is defined as a 
sequence of amino acids heterologous to p53 and capable of 
forming stable homo-tetramers. Exemplary suitable tet- 
ramerization domains include that of the lac repressor, or an 
artificial tetramerization domain, such as variants of the 
GCN4 leucine zipper that form tetramers [Alberti et al, 

(1993) , EMBO J., 12:3227-3236; Harbury et al, (1993), 
Science, 262:1401-1407; Krylov et al, (1994), cited above]. 
One of skill in the art could readily select alternate tetramer- 
ization domains. For ease in description, the tetrameric 
variant of the GCN4 leucine zipper [Harbury et al, (1993), 
cited above] is used herein as the exemplary tetramerization 
domain. This variant has isoleucines at positions d of the 
coiled coil and leucines at positions a, in contrast to the 
original zipper which has leucines and valines, respectively 
[Harbury et al, (1993), cited above]. The nucleotide and 
amino acid sequences of this tetrameric leucine zipper 
variant are presented in the context of the full-length 
sequences, as SEQ ID NO: 5 and NO: 6, respectively. The 
numbering of the amino acid residues follows Ellenberger el 
al, (1992) [cited above]. 

The insertion of the tetramerization domain in the p53 
chimeric protein can be quite liberal, provided the functions 
of the transcription activation (also known as transactiva- 
tion) and DNA binding domains are not disrupted. Prefer- 
ably, the heterologous tetramerization domain would be 
inserted C-terminally to residue 290 of human p53 [SEQ ID 
NO: 2], since this maintains the integrity of both the 
transactivation and DNA binding domains. Functional inac- 
tivation of the p53 tetramerization domain can be accom- 
plished by deletions, insertions and/or amino acid substitu- 
tions. Such mutations should involve residues 322-355 of 
SEQ ID NO: 2, or a subset of these residues, since the p53 
tetramerization domain maps to these residues [Wang et al, 

(1994) , cited above; Clore et al, (1994), cited above]. 
Desirably, selected mutations target residues 328-348 of 
human p53 [SEQ ID NO: 2], or a subset thereof. Within this 
region the most critical residues for tetramer formation are 
residues 337, 341 and 344 of SEQ ID NO: 2. However, 
mutation of other residues within the regions indicated 
above can disrupt tetramer formation. While the mutations 
should involve residues 322-355 [SEQ ID NO: 2], or a 
subset thereof, they need not be confined within the p53 
tetramerization domain. Thus, they can extend as far N-ter- 
minally as residue 290 of human p53 [SEQ ID NO: 2] or as 
far as the p53 C-terminus (residue 393 of SEQ ID NO: 2). 
In addition to the types of mutations just described, func- 
tional inactivation of the p53 tetramerization domain can be 
accomplished by inserting the heterologous tetramerization 
domain within residues 322-355 of human p53 [SEQ ID 
NO: 2], and preferably within residues 328-348 of SEQ ID 
NO: 2. 

Embodiments of the invention are presented in FIGS. 
1F-1K. In one embodiment, the chimeric protein comprises 
a p53 sequence spanning amino acids 1 to 334 of human p53 
[SEQ ID NO:2] fused to an asparagine linker and then to a 
tetrameric variant of GCN4 residues 249-281 [SEQ ID 
NO:6] (FIG. IF). In another embodiment, the chimeric 
protein comprises a p53 sequence spanning amino acids 1 to 
325 of human p53 [SEQ ID NO:2] fused to an arginine- 
glycine-asparagine linker [SEQ ID NO: 7] and then to a 
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tetrameric variant of GCN4 residues 249-281 [SEQ ID 
NO:6] (FIG. 1G). In another embodiment, the chimeric 
protein comprises a p53 sequence spanning amino acids 1 1 
323 of human p53 [SEQ ID NO:2] fused to an arginine- 
glycine-glycine-asparagine-proline-glutamic acid linker 
[SEQ ID NO: 8] and then to a tetrameric variant of GCN4 
residues 250-281 [SEQ ID NO:6] (FIG. 1H). In yet another 
embodiment the chimeric protein comprises a p53 sequence 
spanning from amino acid 1 to 300 of human p53 [SEQ ID 
NO:2] fused to a glycme- glycine- asparagine-glutarriine-ala- 
nine linker [SEQ ID NO: 9] and then to a tetrameric variant 
of GCN4 residues 250-281 [SEQ ID NO:6] (FIG. 1J). In 
another embodiment, the chimeric protein comprises a p53 
sequence spanning amino acids 1 to 325 of human p53 [SEQ 
ID NO:2] fused to an arginine-gly cine-asp aragine linker 
[SEQ ID NO: 7], a tetrameric variant of GCN4 residues 
249-281 [SEQ ID NO:6], an isoleucine linker and then to 
residues 352-393 of human p53 [SEQ ID NO:2] (FIG. IK). 

One might have expected that a heterologous tetramer- 
ization domain would not be able to substitute for the native 
p53 tetramerization domain, because the function of the 20 
tetramerization domain is not only to drive tetramerization, 
but also to position the subunits appropriately relative to one 
another, so that the p53 tetramer can align to the DNA site. 
More specifically, the tetrameric variant of the GCN4 leu- 
cine zipper would be expected to be a particularly unsuitable 25 
choice for a heterologous tetramerization domain, since it 
drives parallel subunit assembly [Harbury et al, (1993), cited 
above], while the native p53 tetramerization domain drives 
antiparallel assembly [Clore et al, (1994), cited above; 
Sakamoto et al, (1994), Proc. Natl. Acad. Sci. USA, 
91:8974-8978]. Nevertheless, the inventor observed that 
such chimeric proteins bound DNA as homotetramers with 
very high efficiency. Without wishing to be bound by theory, 
the inventor believes that while p53 subunits align antipar- 
allel in the absence of DNA, they adopt a parallel orientation 
upon DNA binding. Thus a heterologous tetramerization 
domain that drives parallel assembly of p53 subunits, such 
as the tetrameric variant of the GCN4 leucine zipper, is 
compatible with DNA binding. 

Because the proteins described in this section form 40 
homotetramers and maintain high affinity for the specific 
p53 DNA sites, but do not maintain the integrity of the native 
p53 oligomerization domain, they do not form hetero- 
tetramers with wild-type p53 or tumor-derived p53 mutants, 
and thus will display tumor suppressing activity even in 45 
cancer cells expressing high amounts of mutant p53. Addi- 
tional p53 proteins of this invention can be generated by one 
of skill in the art following the teachings herein. 

B. Modifications of p53 Proteins with Altered Tetramer- 
ization Domains 

In yet another embodiment, the p53 proteins described 
herein contain modifications. These modifications can be 
trivial (defined as having no effect on function) or beneficial 
(i.e. they improve upon some aspect of the protein), and can 55 
include deletions, insertions, amino acid substitutions and/or 
replacement of functional domains or regions of functional 
domains by functionally equivalent domains or regions of 
other proteins. Various modifications of the p53 proteins 
encompassed by the invention are illustrated in FIGS. 2A ^ 
through 2F. It is understood that the proteins of the invention 
may contain more than one of the modifications described 
below. 

Bl. Length of Sequence between the DNA Binding and 
Tetramerization Domains 

The following modification may be made in the context of 
wild-type p53 or in the context of the p53 proteins described 
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in sections V.A1 and V.A2 above. This modification restricts 
the DNA binding specificities of the above mentioned pro- 
teins and involves a change in the length of the sequence 
between the p53 DNA binding and tetramerization domains. 
This modification does not affect the ability of p53 to 
tetramerize, rather it affects the positioning of the DNA 
binding domains relative to one another in a p53 tetramer. 
Since the function of an oligomerization domain in general 
is not only to induce oligomerization, but also to bring 
together the subunits in an appropriate orientation, this 
modification is considered an alteration of the p53 tetramer- 
ization domain, as it affects a function of the tetramerization 
domain and involves sequences that are extensions of the 
p53 tetramerization domain. 

Changing the length of the sequences between the DNA 
binding and tetramerization domains can affect the DNA 
binding properties of wild-type p53 or of a chimeric p53 
protein of this invention both in terms of sequence speci- 
ficity and affinity for DNA. Such changes can therefore 
confer desired properties. 

Without wishing to be bound by theory, the inventor 
realizes that the tetramerization domain of p53 is the site at 
which four p53 subunits contact each other. Thus, the 
positioning of the four p53 DNA binding domains relative to 
each other is dependent on the length of the sequence 
between the C-terminal boundary of the DNA binding 
domain (residue 289 of human p53, [SEQ ID NO: 2]) and 
the N-terminal boundary of the tetramerization domain 
(residue 322 for human wild-type p53, [SEQ ID NO:2]). A 
long linker, such as the linker present in wild-type p53 (i.e., 
residues 289-322 of SEQ ID NO: 2) provides freedom in 
positioning the DNA binding domains relative to one 
another, which in turn allows p53 to bind to different types 
of DNA sites. A long linker, however, reduces the affinity for 
DNA, since it allows p53 to adopt multiple (informations, 
only one of which is compatible with a specific DNA site. A 
short linker, on the other hand, allows p53 to bind only to 
specific types of DNA sites, but the affinity for these sites is 
increased because p53 can adopt few alternate conforma- 
tions. 

The types of DNA sites that wild-type p53 recognizes are 
exemplified by the sites present in the regulatory sequences 
of the wafl and mdm2 genes. The wafl site contains four 
contiguous-pentanucleotides [El-DeiryDeiry et al, (1993), 
cited above]. The mdm2 site contains two pairs of contigu- 
ous pentanucleotides separated by more than thirty nucle- 
otides [Wu et al, (1993), cited above]. The nucleotide 
sequences of these, two sites are: -GAACA-TGTCC- 
CAACA-TGTTG- [SEQ ID NO: 10] and -GGTCA- 
AGTTG-ggacacgtccggcgtcggctgtcggaggagctaagtcc- 
TGACA-TGTCT- [SEQ ID NO: 11], respectively, with the 
specific pentanucleotide repeats in capital letters and demar- 
cated by hyphens. Thus, DNA sites recognized by wild-type 
p53 can have any number of nucleotides between 0 to about 
35 separating two pairs of specific contiguous pentanucle- 
otides. 

To illustrate the effect of changing the length of the 
sequence between the p53 DNA binding and tetramerization 
domains on DNA binding specificity, FIGS. 3A-3H illus- 
trate the effect of deletions between these two domains. FIG. 
3A shows a cartoon of a wild-type p53 tetramer bound to a 
DNA site with contiguous pentanucleotides. FIG. 3B shows 
the same p53 tetramer bound t a DNA site with a 20-30 
nucleotide insert between the 2 pentanucleotide pairs. From 
FIGS. 3A and 3B it is apparent that the naturally-occurring 
sequence between the tetramerization and DNA binding 
domains provides the flexibility for wild-type p53 to recog- 
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nize both types of DNA sites. In FIGS. 3C and 3D the 
sequences (linkers) between the tetramerization and DNA 
binding domains are shortened. This is performed by dele- 
tions within the region spanning residues 290-327 of human 
p53 [SEQ ID NO: 2], preferably inv lving more than 22 
amino acids. Such deletions in the context of wild-type p53 
limit the ability to position one pair of DNA binding 
domains close enough to the other pair. Therefore, DNA 
sites with contiguous pentanucleotides cannot be recognized 
(FIG. 3C). However, the same deletions do not limit the 
ability to recognize DNA sites with a 20-30 nucleotide 
insert between the two pentnucleotide pairs (FIG. 3D). For 
example, wild-type p53, and p53 mutants lacking residues 
290-297 of SEQ ID NO: 2 or 300-308 of SEQ ID NO: 2 or 
300-317 of SEQ ID NO: 2 or 300-321 of SEQ ID NO: 2 
bind to both types of DNA sites. But p53 mutants lacking 
residues 300-327 of SEQ ID NO: 2 or residues 290-297 of 
SEQ ID NO: 2 and 300-321 of SEQ ID NO: 2 bind only to 
DNA sites with a 20 nucleotide insert (see Example 3E). 

This type of modification achieves the same result as 
described above for wild-type p53 when the modification is 
made to the chimeric p53 proteins that retain the antiparallel 
subunit alignment of wild-type p53. Chimeric p53 proteins 
that contain a heterologous dimerization domain (Section 
V.A1 above) have their subunits aligned antiparallel. 

Modifications of the p53 chimeric proteins containing a 
tetramerization domain driving parallel subunit alignment, 
such as the p53 proteins containing the exemplary tetrameric 
variant of the GCN4 leucine zipper (Section V.A2 above) 
produce a different effect. A cartoon of the chimeric protein 
of FIG. IF binding to DNA sites of different types is shown 
in FIGS. 3E and 3F. This chimeric protein can bind to both 
types of DNA sites via flexibility in positioning its DNA 
binding domains. Deletions within the sequences between 
the DNA binding and tetramerization domains create the 
opposite effect than the one observed for wild-type p53. A 
short linker, preferably by the deletion of 22 or more amino 
acids between residues 290 and 334 of SEQ ID NO: 2, 
allows p53 chimeras with parallel tetramerization domains 
to recognize only the DNA sites with contiguous pentanucle- 
otides (FIGS. 3G and 3H). 

While the effect of changing the length of the sequences 
between the DNA binding and tetramerization domains was 
illustrated by introducing deletions between these domains, 
changes in the length can also be introduced by insertions. 
The inserted sequences are p53 or non-p53 sequences. It is 
most meaningful to introduce insertions in the context of 
p53 proteins of this invention with very short sequences (i.e., 
0 to about 12 amino acid residues) between the DNA binding 
and tetramerization domains, for example the protein of 50 
FIG. 1J, to expand the range of DNA sites they can recog- 
nize. Finally the length of the sequences between the DNA 
binding and tetramerization domains can be altered by 
changing the site of insertion of the heterologous oligomer- 
ization domain. 

The novelty of this type of modification relates both to the 
realization that wild-type p53 recognizes DNA sites with 20 
or more nucleotides between the two pairs of contiguous 
pentanucleotides, as well as to the observation that changes 
in the length of the sequences between the p53 DNA binding 
and tetramerization domains of p53 modulate its ability t 
bind to the different types of DNA sites. It has not been 
appreciated before that wild-type p53 can bind DNA sites, 
where the pairs of contiguous pentnucleotide repeats are 
separated by as many as 20 or more nucleotides. While it had 65 
been established that wild-type p53 can bind to the mdm2 
site, it was thought that this site actually contains two DNA 
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sites (each comprising four contiguous pentanucleotides), as 
indicated below [Wu et al, (1993), cited above]: -GGTCA- 
AGTTG-GGACA-CGTCC-ggcgtcggctgtcggag-GAGCTA- 
AGTCC-TGACA-TGTCT- [SEQ ID NO: 11]. The repeats 
are indicated by capital letters and are separated by hyphens. 
However, the underlined repeats contain mismatches from 
the DNA sites recognized by wild-type p53. Such mis- 
matches, even in the context of an otherwise optimal DNA 
site, completely abrogate DNA binding [Halazonetis et al, 
(1993), cited above; and the inventor's own additional 
unpublished observations]. Furthermore, one of the under- 
lined repeats is not even five nucleotides long. In addition, 
if p53 could recognize the underlined repeats, then two p53 
tetramers should be able to bind to the DNA fragment 
illustrated above, but the inventor has determined experi- 
mentally that only one p53 tetramer can bind. Thus, only the 
non-underlined repeats are recognized by wild-type p53 in 
the above DNA fragment, and they are separated by more 
than 30 nucleotides. See the Examples, which also demon- 
strate that wild-type p53 can bind DNA sites with more than 
20 nucleotides between the pairs of pentnucleotide repeats. 

The therapeutic significance of altering the DNA binding 
properties of the p53 chimeric proteins, or of wild-type p53, 
relates to the biological consequences of activation of the 
different p53 target genes. More specifically, induction of the 
waf 1 gene leads 10 tumor suppression [El-DeiryDeiry et al, 
(1993), cited above], and is thus desirable for cancer therapy. 
On the other hand, induction of the mdm2 gene leads to 
expression of the Mdm2 protein, which in turn downregu- 
lates the activity of p53 by masking its transactivation 
domain [Oliner et al, (1992), Nature, 358:80-83; Momand et 
al, (1992), Cell, 69:1237-1245; Oliner et al, (1993), Nature, 
362:857-860; Wu et al, (1993), cited above], and is thus 
undesirable for cancer therapy. Introducing appropriate 
changes in the lengths of the sequences between the DNA 
binding and tetramerization domains of wild-type p53 or 
chimeric p53 proteins of the invention, as described above, 
permits restriction of their DNA binding specificity. 

Thus, p53 proteins of this invention are constructed as 
described above that recognize only DNA sites with con- 
tiguous pentanucleotides or only DNA sites with 20 to 30 
nucleotide inserts between the two pentanucleotide pairs. 
Such a p53 protein of this invention that recognizes the wafl 
DNA site, but not the mdm2 DNA site, has the ability to 
suppress tumor growth, but is not subject to negative regu- 
latory feedback by MdmZ One exemplary p53 protein 
bearing a modification in the length of the sequences 
between the DNA binding and tetramerization domains is 
shown in FIG. 2B, i.e., deletion of residues 300-327 of SEQ 
ID NO: 2. 

B2. Substitutions of Functional Domains or Regions 
Thereof with Equivalent Domains or Regions of Other 
Proteins 

As an example of this type of modification, the p53 
transactivation (also known as transcription activation) 
domain, contained within amino acids 1-90 of human p53 
[SEQ ID NO: 2], is substituted with that of another protein, 
e.g., the herpes simplex virus protein VP 16, also known as 
a trans-inducing factor [Pellett et al, (1985), Proc. Natl. 
Acad. Sci. USA, 82:5870-5874]. The nucleotide sequence 
spanning VP16 nucleotides 2074-2307 [Pellett et al, (1985) 
cited above] is reported as nucleotides 1-234 in SEQ ID NO: 
12. The amino acid sequence of the VP16 HSV fragment 
from amino acid 402 through 479 [Pellett et al, (1985) cited 
above] is reported as amino acid 1-78 in SEQ ID NO: 13. 
See, also, GenBank Code Hell eg. 

One exemplary modified p53 protein of the invention is 
illustrated in FIG. 2C, in which amino acid residues 
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402-479 of VP16 [aa 1-78 of SEQ ID NO: 13] have 
replaced amino acid residues 3-80 of human p53 [SEQ ID 
NO:2]. An equivalent substitution has been reported to 
maintain p53 tumor suppressor function [Pietenpol et al, 
(1994), cited above]. The advantage of this substitution is 5 
that in certain tumors, overexpression of the Mdm2 protein 
suppresses p53-mediated transcription by masking its trans- 
activation domain [Oliner et al, (1992), cited above; 
Momand et al, (1992), cited above; Oliner et al, (1993), cited 
above]. When the transactivation domain of p53 is replaced 10 
by that of VP16, it is no longer inhibited by Mdm2, because 
Mdm2 does not suppress the transcriptional activity medi- 
ated by VP16 [Oliner et al, (1993), cited above]. 

Substitution of a functional domain can involve part of the 
domain. For example, amino acid residues 1-52 of p53 15 
[SEQ ID NO: 2] are sufficient to mediate interaction with 
Mdm2, but residues 18-52 of p53 [SEQ ID NO: 2] are not 
[Chen et al, (1993), Mol. Cell. Biol., 13:4107-4114]. Thus, 
replacement of only residues 1-18 of p53 [SEQ ID NO: 2] 
with sequences from the VP16 activation domain abolishes 20 
the p53-Mdm2 interaction. As another example, since within 
residues 1-52 of p53 [SEQ ID NO: 2], amino acids Leu 14, 
Phe 19, Leu 22 and Trp 23, are critical for interaction with 
Mdm2 [Lin et al, (1994), Genes Dev., 8:1235-1246], the 
substitution involves only these residues. 25 
B3. Nuclear Localization Signals 
Nuclear localization is required for p53 function [Shaul- 
sky et al, (1991), Oncogene, 6:2056-2065]. Wild-type p53 
contains three nuclear localization signals (NLS), all of 
which map to the C-terminus of wild-type p53 and specifi- 30 
cally to residues 316-325, 369-375 and 379-384 of p53 
[SEQ ID NO: 2] [Shaulsky et al, (1990), Mol. Cell. Biol., 
10:6565-6577]. Some of the p53 proteins of the invention 
lack one or more of these NLS. Thus, such proteins have an 
impaired ability to localize to the nucleus and consequently 
their function is impaired. It is therefore beneficial to rein- 
troduce NLS. Even p53 proteins that maintain all three NLS 
may benefit from introduction of additional NLS. Therefore 
an analog of the p53 proteins described above may contain 
a NLS fused to its N-terminus, or its C-terminus, or at the 
junction of the transactivation and DNA binding domains or 
at the junction of the DNA binding and tetramerization 
domains or elsewhere in the protein, as long as the function 
of the p53 protein is not disrupted by insertion of the NLS. 
FIG. 2D demonstrates the insertion of a NLS at the boundary " 
of the transactivation and DNA binding domains. The NLS 
may be that of p53 or of any other nuclear protein, such as 
the NLS of SV40 large T antigen which is comprised of 
amino acids proline-lysine-lysine-lysine-arginine-lysine-va- 
line [SEQ ID NO: 14] [Kalderon et al, (1984), Cell, 5( 
39:499-509]. Additional heterologous NLS are described by 
Shaulsky et al, (1990); (1991) [cited above]. 

B4. Substitution by Homologous Non-human p53 
Sequences 5; 

The amino acid sequence of p53 is conserved across 
species [Soussi et al, (1990), cited above], implying that 
function is also conserved. Indeed, analysis of xenopus and 
human p53 proteins has revealed no functional differences 
[Cox et al, (1994), Oncogene, 9:2951-2959]. Thus, human eo 
p53 sequences of the p53 [SEQ ID NO: 2] proteins of this 
invention can readily be replaced with the homologous 
non-human p53 sequences. 

The sequences of human p53 [SEQ ID NO: 2] and certain 
non-human p53 proteins have been aligned by Soussi et al, 65 
(1990), cited above. This alignment permits identification of 
regions that are homologous across species. For p53 species 
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that are not listed by Soussi et a], (1990), cited above, the 
alignment to the human p53 [SEQ ID NO: 2] sequences is 
btained by computer programs commercially availa ble and 
kn wn in the art, such as the program BESTFTT of the 
University of Wisconsin GCG package. 

The benefit of substituting human p53 [SEQ ID NO: 2] 
sequences with equivalent non-human sequences relates to 
the realization that interactions of p53 with specific cellular 
or viral proteins are species-specific. For example, human 
p53 is inactivated by the human Mdm2 protein [Oliner et al, 
(1993), cited above; Momand et al, (1992), cited above; Wu 
et al, (1993), cited above]. Non-human p53 sequences have 
lower or no affinity for the human Mdm2. Thus, p53 proteins 
of this invention that contain non-human p53 sequences are 
not susceptible to inhibition by Mdm2. 

One of skill in the art with resort to the teaching of this 
invention may readily select which non-human p53 
sequence to use, as well as the extent of the region for 
substitution. For example, the p53-Mdm2 interaction has 
been mapped to residues 1-52 of p53 [SEQ ID NO: 2], and 
more specifically to residues 14, 19, 22 and 2of SEQ ID NO: 
2 [Chen et al, (1993), cited above; Lin et al; (1994), cited 
above]. Thus, to eliminate the p53-Mdm2 interaction, N-ter- 
minal sequences within residues 1-52 of the p53 proteins of 
this invention are substituted with the homologous non- 
human p53 sequences. 

The species of p53 that can be used to substitute for the 
human p53 sequences can readily be selected by one of skill 
in the art Species, such as xenopus and trout, that diverge 
most from human p53 [Soussi et al, (1990), cited above] are 
preferred, although other species may also be selected. 

As an exemplary modification of this type, residues 3-80 
of human p53 [SEQ ID NO: 2] are substituted by the 
homologous xenopus sequence (FIG. 2E) to produce a 
modified p53 protein incapable of interacting with Mdm2. 
B5. Amino Acid Substitutions, Deletions and Insertions 
Other modifications of the p53 proteins described in this 
invention include amino acid substitutions, small deletions 
and small insertions. (Deletions and insertions within the 
sequences between the DNA binding and tetramerization 
domains are discussed in section Bl above.) These modifi- 
cations involve either the p53 sequences or the heterologous 
oligomerization domain sequences or both. The modifica- 
tions may enhance function or introduce a useful property. 
For example a modification may introduce a tag to optimize 
protein purification [Scopes (1994), Protein Purification, 
Principles and Practice, third edition, Springer- Verlag, New 
York], or may enhance expression and/or stability of a p53 
protein of the invention when expressed in vitro or in a 
patient. Modifications in the p53 fragment may enhance 
DNA binding and growth suppressing activities. Two such 
modifications have already been described: substitution of 
arginine 174 with glutamine or of arginine 175 with leucine 
(the numbering refers to human p53 [SEQ ID NO: 2]; in 
mouse p53 the corresponding residues are 171 and 172 of 
SEQ ID NO: 15, respectively) [Halazonetis and Kandil 
(1993), cited above; Li et al, (1994), Cell Growth Differen- 
tiation, 5:711-721]. 

Modifications in p53 may also affect interaction with 
cellular or viral proteins, for example, substitution of leucine 
14 of SEQ ID NO: 2 with glutamine and phenylalanine 19 
of SEQ ID NO: 2 with serine abolish the p53-Mdm2 
interaction [Lin et al, (1994), cited above]. Modifications in 
the heterologous olig merization domain may increase the 
stability of tetramer formation, for example, substitutions 
that stabilize ligomerization driven by leucine zippers are 
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known [Krylov et al, (1994), cited above; O'Shea et al, 
(1992), cited above]. As an exemplary modification of this 
type, residues 174 or 175 of human p53 [SEQ ID NO: 2] are 
substituted by glutamine or leucine, respectively (FIG. 2F) 
in a p53 chimeric protein of this invention. 

B6. Linkers between p53 and the Heterologous Oligo- 
merization Domain 

Another modification of the proteins of this invention is 
the presence of an amino acid or peptide linker between the 
p53 fragment and the heterologous oligomerization domain. 
In one embodiment of this invention (FIG. 1C), there is no 
linker between p53 and the GCN4 leucine zipper. In other 
embodiments however (FIGS. IB, IF and IK), there are 
glutamic acid or asparagine or isoleucine linkers, respec- 
tively. Linkers may be present for cloning convenience or to 
confer some useful property. For example, residues that 
stabilize specific secondary structure elements, such as 
ot-helices, are known [Richardson and Richardson (1988), 
Science 240:1648-1652]. Such residues can be introduced 
in the linkers to stabilize the heterologous oligomerization 
domains. For example the linkers glycine-asparagine, argi- 
nine-glycine-asparagine [SEQ ID NO: 7], arginine-glycine- 
glycine-asparagine-proline-glutarnic acid [SEQ ID NO: 8], 
glycine-glycine-asparagine-glutamine-alanine [SEQ ID NO: 
9] present in the examples shown in FIGS. IE, 1G and IK, 25 
1H and LI, respectively, are all designed to stabilize the 
N-terminus of the ot-helical heterologous oligomerization 
domain. A variety of other amino acid or peptide linkers may 
be used for the reasons discussed above, provided they do 
not interfere with the function of the p53 chimeric protein. 

C. Nucleic Acid Sequences Encoding p53 Proteins of the 
Invention 

The present invention further provides nucleic acid 
sequences encoding the proteins of this invention, which 
includes the proteins described in sections V.A and V.B 
above. In addition to the coding strand, the nucleic acid 
sequences of the invention include the complementary DNA 
sequence representing the non-coding strand, the messenger 
RNA sequence, the corresponding cDNA sequence and the 
RNA sequence complementary to the messenger RNA 
sequence. Variants of these nucleic acids of the invention 
include variations due to the degeneracy of the genetic code 
and are encompassed by this invention. Such variants may 
be readily identified and/or constructed by one of skill in the 45 
art. In certain cases specific codon usage may be employed 
to optimize expression. The above nucleotide sequences can 
be included within larger DNA or RNA fragments, or may 
be interrupted by introns. 

In another embodiment the nucleic acids encoding the p53 
proteins of the invention are present in the context of vectors 
suitable for amplification in prokaryotic or eukaryotic cells. 
Many such vectors are known [Ausubel et al, (1994), cited 
above] and many of these are commercially available. For 
example plasmids with bacterial or yeast replication origins 
allow amplification in bacteria or yeast, respectively. Such 
vectors allow the production of large quantities of nucleic 
acids encoding the proteins of the invention, which nucleic 
acids can be used for gene therapy or for expression of the 
p53 proteins of the invention. 

In yet another embodiment the nucleic acids encoding the 
proteins of the invention are present in the context of vectors 
suitable for expression in cell-free extracts or lysates or in 
prokaryotic or eukaryotic cells. Many such vectors are 
known [Ausubel et al, (1994), cited above] and many of 65 
these are commercially available. For example, the vector 
pGEM4 (Promega, Madison, Wis.) is suitable for expression 
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of the p53 proteins in cell-free lysates, while the vector 
pSV2 [Mulligan et al, (1992), cited ab ve] is suitable for 
expression in mammalian cells. Such vectors allow the 
production of the proteins of the invention in vitro for 
analysis of their functional properties or for delivery to 
patients. 
D. Gene Therapy 

The nucleic acid sequences of the invention may be 
inserted into a vector capable of targeting and infecting a 
desired cell, either in vivo or ex vivo for gene therapy, and 
causing the encoded p53 protein of this invention to be 
expressed by that cell. Many such viral vectors are useful for 
this purpose, e.g., adenoviruses, retroviruses and adeno- 
associated viruses (AAV) [Schreiber et al, (1993), Biotech- 
niques, 14:818-823; Davidson et al, (1993), Nature Genet- 
ics, 3:219-223; Roessler et al, (1993), J. Clin. Invest, 
92:1085-1092; Smythe et al, (1994), Ann. Thorac. Surg., 
57:1395-1401; Kaplitt et al, (1994), Nature Genetics, 
8:148-154]. There has already been success using viral 
vectors driving expression of wild-type p53 [Fujiwara et al, 
(1993), Cancer Res., 53:4129-^133; Fujiwara et al, (1994), 
Cancer Res., 54:2287-2291; Friedmann (1992), Cancer, 
70(6 Suppl): 1810-1817; Fujiwara et al, (1994b), Curr. 
Opin. Oncol., 6:96-105]. 

For use in gene therapy, these viral vectors containing 
nucleic acid sequences encoding a p53 protein of the inven- 
tion, are prepared by one of skill in the art with resort to 
conventional techniques (see references mentioned above). 
For example, a recombinant viral vector, e.g. an adenovirus, 
of the present invention comprises DNA of at least that 
portion of the viral genome which is capable of infecting the 
target cells operatively linked to the nucleic acid sequences 
of the invention. By "infection" is generally meant the 
process by which a virus transfers genetic material to its host 
or target cell. Preferably, the virus used in the construction 
of a vector of the invention is rendered replication-defective 
to remove the effects of viral replication on the target cells. 
In such cases, the replication-defective viral genome can be 
packaged by a helper virus in association with conventional 
techniques. 

Briefly, the vectors) containing the nucleic acids encod- 
ing a protein of the invention is suspended in a pharmaceu- 
tically acceptable carrier, such as saline, and administered 
parenterally (or by other suitable means) in sufficient 
amounts to infect the desired cells and provide sufficient 
levels of p53 activity to arrest abnormal cellular prolifera- 
tion. Other pharmaceutically acceptable carriers are well 
known to those of skill in the art. A suitable amount of the 
vector containing the chimeric nucleic acid sequences is 
between about 10 6 to 10 9 infectious particles per mL carrier. 
The delivery of the vector may be repeated as needed to 
sustain satisfactory levels of p53 activity, as determined by 
monitoring clinical symptoms. 

As desired, this therapy may be combined with other 
therapies for the disease or condition being treated. For 
example, therapy involving the administration of a vector 
capable of expressing a p53 protein of the invention is well 
suited for use in conjunction with conventional cancer 
therapies, including surgery, radiation and chemotherapy. 

Nucleic acid sequences driving expression of a p53 pro- 
tein of the invention may also be introduced by "carriers" 
other than viral vectors, such as liposomes, nucleic acid- 
coated gold beads or can simply be injected in situ [Fujiwara 
et al (1994b), cited above; Fynan et al, (1993), Proc. Natl. 
Acad. Sci. USA, 90:11478-11482; Cohen (1993), Science, 
259:1691-1692; Wolff et al, (1991), Biotechniques, 
11:474-485]. 
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E. Pharmaceutical C mpositions 
The proteins of this invention may also be formulated into 
pharmaceutical compositions and administered using a 
therapeutic regimen compatible with the particular formu- 
lation. Pharmaceutical compositions within the scope of the 
present invention include compositions containing a protein 
of the invention in an effective amount to have the desired 
physiological effect, e.g. to arrest the growth of cancer cells 
without causing unacceptable toxicity for the patient 

Suitable formulations for parenteral administration 
include aqueous solutions of the active compounds in water- 
soluble or water-dispersible form, e.g. saline. Alternatively, 
suspensions of the active compounds may be administered 
in suitable conventional lipophilic carriers or in liposomes. 

The compositions may be supplemented by active phar- 
maceutical ingredients, where desired. Optional antibacte- 
rial, antiseptic, and antioxidant agents in the compositions 
can perform their ordinary functions. 

The pharmaceutical compositions of the invention may 
further contain any of a number of suitable viscosity enhanc- 
ers, stabilizers, excipients and auxiliaries which facilitate 
processing of the active compounds into preparations that 
can be used pharmaceutical^. Preferably, these prepara- 
tions, as well as those preparations discussed below, are 
designed for parenteral administration. However, composi- 
tions designed for oral or rectal adrriinistration are also 
considered to fall within the scope of the present invention. 

Those of skill in the pharmaceutical art should be able to 
derive suitable dosages and schedules of administration. As 
used herein, the terms "suitable amount" or "effective 
amount" means an amount which is effective to treat the 
conditions referred to below. A preferred dose of a pharma- 
ceutical composition containing a protein of this invention is 
generally effective above about 0.1 mg p53 protein per kg of 
body weight (mg/kg), and preferably from about 1 mg/kg to 
about 100 mg/kg. These doses may be administered with a 
frequency necessary to achieve and maintain satisfactory 
p53 activity levels. Although a preferred range has been 
described above, determination of the effective amounts for 
treatment of each type of tumor or other condition may be 
determined by those of skill in the art. 

Dosage units of such pharmaceutical compositions con- 
taining the proteins of this invention preferably contain 
about 1 mg to 5 g of the protein. 
F. Therapeutic Indications 

The nucleic acids and proteins of the invention can be 
introduced into human patients for therapeutic benefits in 
conditions characterized by insufficient wild-type p53 activ- 
ity. As stated above, the nucleic acids of the invention may 
be introduced into the patient in the form of a suitable viral 
vectors (or by direct DNA delivery) to harness the patient's 
cellular machinery to express the proteins of the invention in 
vivo. Alternatively, the proteins of the invention may be 
introduced into the patient in appropriate pharmaceutical 
formulations as described above. 

As one example, the pharmaceutical compositions of this 
invention, containing a protein of the invention or a nucleic 
acid or a viral vector which express a protein of the 
invention in vivo, may be employed to induce the cellular 
defence to DNA damaging agents. Examples of DNA dam- 
aging agents include sunlight UV irradiation, as well as 
radiation and chemotherapeutics used for cancer treatment. 
By administering a suitable amount of a composition of this 
invention, patients may tolerate higher doses of such DNA 
damaging agents. 

Another therapeutic use of the compositions of this inven- 
tion is in inducing apoptosis of specific cells, such as 
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proliferating lymphocytes. According to this method of use, 
a suitable amount of an appropriate pharmaceutical compo- 
sition of this inventi n is administered to a subject to 
enhance the development of immune tolerance. This method 
may employ both in vivo and ex vivo modes of adminis- 
tration. Preferably, this therapy is useful as the sole treatment 
or as an accessory treatment to prevent transplant rejection, 
or to treat autoimmune diseases, e.g., systemic lupus eryth- 
rematosis, rheumatoid arthritis and the like. 

The pharmaceutical compositions of this invention may 
also be employed to restore p53 function in tumor cells. 
Introduction of p53 function in tumor cells leads to arrest of 
cell proliferation or to cell death [Finlay et al, (1989), cited 
above; Eliyahu et al, (1989), cited above; Baker et al, (1990), 
cited above; Mercer et al, (1990), cited above; Diller et al, 
(1990), cited above; Isaacs et al, (1991), cited above; Yon- 
ish-Rouach et al, (1993), cited above; Fujiwara et al, (1993), 
cited above]. In addition p53 function primes tumor cells to 
undergo cell death in response to DNA damaging agents 
currently used in cancer therapy [Lowe et al, (1993), cited 
above; Fujiwara et al, (1994), cited above; Fisher (1994), 
Cell, 78:539-542], Desirably, a suitable amount of the 
composition of this invention is administered systemically, 
or locally to the site of the tumor with or without concurrent 
adnumstration of conventional cancer therapy (i.e. DNA 
damaging agents). 

Additionally, the compositions of this invention may be 
administered in methods to suppress cell proliferation in 
diseases other than cancers, which are characterized by 
aberrant cell proliferation. Among such diseases are 
included psoriasis, atherosclerosis and arterial restenosis. 
This method is conducted by administering a suitable 
amount of the selected composition systemically or locally 
to the patient. 

VI. SPECIFIC EXAMPLES 

The present invention provides p53 proteins with modi- 
fications in the native p53 sequence. These modifications, 
which do not interfere with its native tumor-suppressor 
function, provide the protein with at least one of the fol- 
lowing functional characteristics: (1) the ability to bind 
DNA and activate transcription like wild-type p53, but to not 
hetero-oligomerize with wild-type p53 or tumor-derived p53 
mutants; and (2) restricted DNA binding specificity from an 
alteration in the way that the tetramerization domain orients 
the DNA binding domains of a p53 tetramer relative to one 
another. Exemplary p53 proteins of this invention, which 
demonstrate the aforementioned functional characteristics, 
are described in sections V.A and V.B, above. Additional 
functionally equivalent proteins can be constructed by one 
of skill in the art with resort to the teachings of sections V.A 
and V.B and the Examples below. 

The following examples illustrate preferred methods for 
preparing p53 proteins of the invention and characterizing 
their functional activities. These examples are illustrative 
only and do not limit the scope of the invention. As 
throughout the specification, numbering of the human p53 
[SEQ ID NO: 2] amino acid residues in the examples 
follows Zakut-Houri et al, (1994) [cited above], while 
numbering of the GCN4 residues follows Ellenberger et al, 
(1992) [cited above]. 
Example 1 — Recombinant Plasmids 
Standard cloning procedures were used to prepare the 
plasmids described below [Ausubel et al, (1994), cited 
above; Innis et al, (1990), cited above]. 
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A. Plasmid pGEMhump53wt 

Plasmid pGEMhump53wt encodes full-length human 
wild-type p53 [SEQ ID NOS: 1 and 2]. This plasmid was 
prepared by PCR [Innis et al, (1990), cited ab ve] using a 
human p53 cDNA, which is readily available to those 5 
practicing the art. The PCR procedure was designed to 
incorporate unique restriction sites within the coding 
sequence of human p53 [SEQ ID NO: 1]: Kpn I at codon 
218, Sst I at codon 299, Sst II at codon 333, Bst BI at codon 
338 and Sal I immediately following the termination codon. 10 
An Msc I site at codon 138 was eliminated. These changes 
did not alter the sequence of the encoded pS3, and were only 
performed to expedite construction of mutant proteins bear- 
ing altered tetramerization domains or point mutations asso- 
ciated with human cancer. T7ie PCR product of the human 15 
p53 cDNA was digested with Nco I and Sal I and cloned in 
the vector pGEM4 [Promega, Madison, Wis.], which was 
linearized with Eco RI and Sal I. Synthetic oligonucleotides 
were used to bridge the Eco RI site of the vector and the Nco 
I site at the initiation codon of p53. The entire nucleotide 2Q 
sequence of the Eco Rl-Sal I human p53 insert in plasmid 
pGEMhump53wt is presented as SEQ ID NO: 20 (the first 
nucleotide of the EcoRI site is numbered nucleotide 1). 
Plasmid, pGEMhump53wt, was used to generate all the p53 
mutants described below, as well as for expression of ^ 
wild-type p53 by in vitro translation. 

B. Plasmids of the pGEMhump53 Series Encoding p53 
Proteins with Altered Tetramerization Domains 

Bl. Plasmid pGEMhump53LZ346E 

A fragment encoding amino acids 253-281 [SEQ ED NO: 30 
4] of the yeast transcription factor GCN4 [SEQ ID NO:*4] 
[Hinnenbusch et al, (1984), cited above] was prepared by 
PCR of plasmid pSP64-GCN4 [Halazonetis et al, (1988), 
Cell 55:917-924]. The sequence of the 5' PCR primer 
GCAGAGGAGC AAAAGCTTGA AGACAAGGTT [SEQ 35 
ID NO: 21] incorporates a Hindlll site, while the sequence 
of the 3' primer CTTCAGGTCG ACTCAGCGTT CGC- 
CAACTA A TTTC [SEQ ID NO: 2] incorporates a termina- 
tion codon and a Sal I restriction site. The PCR fragment was 
blunt-ended at the Hind HI site and cloned into 40 
pGEMhump53wt linearized with Stu I and Sal I. The 
resultant plasmid, pGEMhump53LZ346E, encodes amino 
acids 1-346 of human p53 [SEQ ID NO: 2], a glutamic acid, 
and then amino acids 253-281 of GCN4 [SEQ ID NO: 4]. 

B2. Plasmid pGEMhump53LZ347 45 

A fragment encoding amino acids 253-281 [SEQ ID NO: 
4] of the yeast transcription factor GCN4 [SEQ ID NO: 4] 
[Hinnenbusch et al, (1984), cited above] was prepared by 
PCR of plasmid pSP64-GCN4 [Halazonetis et al, (1988), 5Q 
cited above]. The sequence of the 5' PCR primer ATGAG- 
GCCTT GGAAGACAAG GTTGAAGAAT TG [SEQ ID 
NO: 23] incorporates a Stu I site, while the sequence of the 
3' primer CTTCAGGTCG ACTCAGCGTT CGCCAAC- 
TAA TTTC [SEQ ID NO: 22] incorporates a termination 55 
codon and a Sal I restriction site. The PCR fragment was 
cloned into pGEMhump53wt linearized with Stu I and Sal I. 
The resultant plasmid, pGEMhump53LZ347, encodes 
amino acids 1-347 of human p53 [SEQ ID NO: 2] and 
amino acids 253-281 of GCN4 [SEQ ID NO: 4]. 

B3. Plasmid pGEMhump53LZ335Q 

A fragment encoding amino acids 253-281 of the yeast 
transcription factor GCN4 [SEQ ID NO: 4] [Hinnenbusch et 
al, (1984), cited above] was prepared by PCR of plasmid 
pSP64-GCN4 [Halazonetis et al, (1988), cited above]. The 65 
sequence of the 5' PCR primer GCAGAGGAGC 
AAAAGCTTGA AGACAAGGTT [SEQ ID NO: 21] incor- 
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porates a Hind III site, while the sequence of the 3' prime r 
CTTCAGGTCG ACTCAGCGTT CGCCAACTAA TTTC 
[SEQ ID NO: 22] incorporates a termination codon and a Sal 
I restriction site. The PCR fragment was blunt-ended at the 
Hind III site and cloned into pGEMhump53wt linearized 
with Sst II and Sal I. The Sst II site of the vector and the 
blunt-ended Hind III site of the PCR product were bridged 
by annealed synthetic oligonucleotides GGGCGTC [SEQ 
ID NO: 24] and GACGCCCGC [SEQ ID NO: 25]. The 
resultant plasmid, pGEMhump53LZ335Q, encodes amino 
acids 1-335 of human p53 [SEQ ID NO: 2], a glutamine, 
and then amino adds 253-281 of GCN4 [SEQ ID NO: 4]. 

B4. Plasmid pGEMhump53LZ343 

A fragment encoding amino acids 253-281 of the yeast 
transcription factor GCN4 [SEQ ID NO: 4] [Hinnenbusch et 
al, (1984), cited above) was prepared by PCR of plasmid 
pSP64-GCN4 [Halazonetis et al, (1988), cited above]. The 
sequence of the 5* PCR primer GCAGAGGAGC 
AAAAGCTTGA AGACAAGGTT [SEQ ID NO: 21] incor- 
porates a Hind III site, while the sequence of the 3' primer 
CTTCAGGTCG ACTCAGCGTT CGCCAACTAA TTTC 
[SEQ ID NO: 22] incorporates a termination codon and a Sal 
I restriction site. The PCR fragment was blunt-ended at the 
Hind III site and cloned into pGEMhump53wt linearized 
with Bst BI and Sal I. The Bst BI site of the vector and the 
blunt-ended Hind in site of the PCR product were bridged 
by annealed synthetic oligonucleotides CGAAATGTTC 
CGAGAGCGAA TGAAAC and GTTTCATTCG 
CTCTCGGAAC ATTT [SEQ ID NO: 26 and 27]. The 
resultant plasmid, pGEMhump53LZ343, encodes amino 
acids 1-343 of human p53 [SEQ ID NO: 2] and then amino 
acids 249-281 of GCN4 [SEQ ID NO: 4]; 

B5. Plasmid pGEMhump53TZ334N 

Synthetic oligonucleotides were used to generate a tet- 
rameric variant of the GCN4 leucine zipper. These oligo- 
nucleotides TATCCGCGGT AATCGTCTGA AACA- 
GATCGA AGACAAGTTA GAAGAAATCC 
TTTCG AAGCT CTATCACATC GAG and TTTGTCGACT 
CAACGTTCAC CCAATAATTT TTTGATGCGC 
GCTAACTCAT TCTCGATGTG ATAGAGCTTC G [SEQ 
ID NO: 28 and 29] were subjected to a PCR cycle in the 
absence of any additional DNA. The PCR product was 
digested with restriction endonucleases Sst II and Sal I and 
cloned into pGEMhump53wt linearized with Sst II and Sal 
I. The resultant plasmid, pGEMhump53TZ334N, encodes 
amino acids 1-334 of human p53 [SEQ ID NO: 2], an 
asparagine, and then the tetrameric zipper variant corre- 
sponding to amino acids 249-281 [SEQ ID NO: 6] of 
GCN4. 

B6. Plasmid pGEMhump53TZ323RGN 

A deletion of residues 324-332 of human p53 [SEQ ID 
NO: 2] was created within the Sst I-^Sst II fragment of 
plasmid pGEMhump53TZ334N by substituting the wild- 
type Sst I— Sst II fragment with a PCR fragment generated 
u sing th e oligonucleotide TTCTCCGCGG AGTGGTTTCT 
TCTTTGGCTG [SEQ ID NO: 30]. The resultant plasmid, 
pGEMhump53TZ323RGN, encodes amino acids 1-323 of 
human p53 [SEQ ID NO: 2], an arginine-glycine-asparagine 
tripeptide [SEQ ID NO: 7], and then the tetrameric zipper 
variant corresponding to amino acids 249-28 1 [SEQ ID NO: 
6] of GCN4. 

B7. Plasmid pGEMhump53TZ334GNPE 

This plasmid is a modification of plasmid 
pGEMhump53TZ334N. The Sst II-Sal I fragment of 
pGEMhump53TZ334N containing the tetrameric zipper was 
modified by PCR using the primer TATCCGCGGT 
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GGAAATCCTG AACTGAAACA GATCGAAGAC AAG 
[SEQ ID NO: 31]. The PCR fragment was cloned using the 
Sst II-Sal I sites into pGEMhump53TZ334N, replacing the 
original Sst II-Sal I fragment. The resultant plasmid, 
pGEMhump53TZ334GNPE, encodes amino acids 1-334 of 
human p53 [SEQ ID NO: 2], a glyrine-asparagine-proline- 
glutamic acid tetrapeptide [SEQ ID NO: 32] and the tet- 
rameric zipper variant corresponding to amino acids 
250-281 [SEQ ID NO: 6] of GCN4. 
B8. Plasmid pGEMhump53LZ346E352I 
This plasmid is a modification of plasmid 
pGEMhump53LZ346E. A Cla I restriction site was intro- 
duced just after the last codon of pGEMhump53LZ346E by 
PCR with the primer GTCATCGATG CGTTCGCCAA 
CTAATTTCTT [SEQ ID NO: 33]. A PCR fragment encod- 
ing residues 352-393 of human p53 [SEQ ID NO:2] con- 
taining a Cla I site at its 5' end was also generated using the 
primer ATGAGGCCTT GGAACTCATC GATGCCCAGG 
CTGGG of SEQ ID NO: 34. The latter fragment was cloned 
using the Cla I-Sal I sites into the modified (using the primer 20 
of SEQ ID NO: 33) pGEMhump53LZ346E vector. The 
resultant plasmid, pGEMhump53LZ346E352I, encodes 
amino acids 1-346 of human p53 [SEQ ID NO: 2], a 
glutamic acid, the leucine zipper corresponding to amino 
acids 253-281 of GCN4 [SEQ ID NO: 4], an isoleucine and 25 
then residues 352-393 of human p53 [SEQ ID NO: 2]. 
B9. Plasmid pGEMhump53TZ334N352I 
This plasmid is a modification of plasmid 
pGEMhump53TZ334N. A Cla I restriction site was intro- 
duced just after the last codon of pGEMhump53TZ334N by 
PCR with the primer TTTGTCGACT CAATCGATAC 
GTTCACCCAA TAATTTTTTG [SEQ ID NO: 35]. A PCR 
fragment encoding residues 352-393 of human p53 [SEQ ID 
NO:2] containing a Gal site at its 5' end was also generated 
using the primer ATGAGGCCTT GGAACTCATC GAT- 
GCCCAGG CTGGG of SEQ ID NO: 34. The latter frag- 
ment was cloned using the Cla I-Sal I sites into the modified 
(using the primer of SEQ ID NO: 35) 
pGEMhump53TZ334N vector. The resultant plasmid, 
pGEMhurnp53TZ334N352I, encodes amino acids 1-334 of 40 
human p53 [SEQ ID NO: 2], an asparagine, the tetrameric 
zipper variant corresponding to amino acids 249-281 [SEQ 
ID NO: 6] of GCN4, an isoleucine and then residues 
352-393 of human p53 [SEQ ID NO: 2]. 

BIO. Plasmids pGEMhump53H175, 

pGEMhump53Q334, P GEMhump53L337, 
pGEMhump53A341 and pGEMhump53A344 

Plasmids pGEMhump53H175 ( pGEMhump53Q334, 
pGEMhump53L337, pGEMhump53A341 and 
pGEMhump53A344 encode proteins that differ by one 
amino acid from wild-type human p53 [SEQ ID NO: 2]. 
Specifically, they substitute Argl75 with His, Gly334 with 
Gin, Arg337 with Leu, Phe341 with Ala and Leu344 with 
Ala of p53 SEQ ID NO: 2, respectively. These single amino 
acid substitutions were generated using standard recombi- 
nant techniques [Ausubel et al, (1994), cited above; Innis et 
al, (1990), cited above]. Substitution of Argl75 of SEQ ID 
NO: 2 with His is often associated with human tumors. 

B 1 1 . Plasmids pGEMhump53D290-297, 

pGEMhump53D290-297D300-321, pGEMhump53D300- 
308 , pGEMhump53D300 -317, pGEMhump53D300-321, 
pGEMhump53D30O-327, and pGEMhump53D364-393 

Plasmids pGEMhurap53D290-297, 
pGEMhump53D290-297D300-321 , pGEMhump53D300- 
308, pGEMhump53D300-317, pGEMhump53D300-321, 
pGEMhump53D300-327, and pGEMhump53D364-393 
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encode proteins that contain deletions within wild-type 
human p53 [SEQ ID NO: 2]. These deletions involve 
residues 290-297, 290-297 and 300-321, 300-308, 
300-317, 300-321, 300-327 and 364-393 of p53 SEQ ID 
NO: 2, respectively. These deletions were generated using 
standard recombinant techniques [Ausubel et al, (1994), 
cited above; Innis et al, (1990), cited above], 
G Plasmid pSV2hump53wt 

Plasmid pSV2hump53wt encodes full-length human 
wild-type p53 [SEQ ID NO: 2], and directs transcription of 
this protein in mammalian cells. The pSV2 vector has been 
previously described [Mulligan et al, (1981), Proc. Natl. 
Acad. Sci. USA, 78:2072-2076]. A pSV2 vector containing 
a human c-jun insert has also been described [Zhang et al, 
(1990), Proc, Nad. Acad. Sci. USA, 87:6281-6285], The 
c-jun insert was removed from the latter plasmid using Sal 
I and Bgl II restriction endonucleases, and the ends of the 
vector were blunted Into this vector a blunted Eco RI-Hind 
III p53 insert from pGEMhump53wt was cloned. 

D. Plasmids of the pSV2hump53 Series Encoding p53 
Proteins with Altered Tetramerization Domains 

Because plasmids pGEMhump53wt and pSV2hump53wt 
contain the same p53 insert, it is possible to use restriction 
sites that are common within the inserts of these plasmids, 
to transfer p53 subfragments from plasmids of the pGEM- 
hump53 series to pSV2hump53wt Specifically, it is possible 
to transfer, for example, Sst II-Sal I fragments encoding 
altered tetramerization domains into the pSV2hump53 vec- 
tor, and thus allow expression of p53 proteins of the inven- 
tion in mammalian cells. 

pSV2 vectors expressing most of the proteins described 
above have been constructed. The name of the p53 protein 
with altered tetramerization domain is retained from the 
pGEM to the pSV2 series. For example, transfer of the Sst 
II-Sal I fragment of pGEMhump53TZ334N to 
pSV2hump53wt, yields pSV2hump53TZ334N, which 
allows expression in mammalian cells of a p53 protein 
containing amino acids 1-334 of human p53 [SEQ ID NO: 
2], an asparagine, and then the tetrameric zipper variant 
corresponding to amino acids 249-281 [SEQ ID NO: 6] of 
GCN4. 

E. Reporter Plasmids lb Assay p53-Mediated Transcrip- 
tional Activity 

The reporter plasmids drive expression of alkaline phos- 
phatase in a p53 -dependent manner. Plasmids pEwafl-TK- 
SEAP, pBC.V4A-TK-SEAP and pBC-TK-SEAP have one 
copy each of double-stranded oligonucleotides Ewafl [SEQ 
ID NO: 16], BC.V4A [SEQ ID NO: 17] and BC [SEQ ID 
NO: 18], respectively cloned into the unique Eco RV site of 
pTK-SEAR These oligonucleotides contain p53 binding 
sites of different affinities. 

The sequence of oligonucleotide Ewafl (top strand) is: 
CCC-GAACA-TGTCC-CAACA-TGTTG-GGG [SEQ ID 
NO: 16], This oligonucleotide corresponds to the enhancer 
that drives p53-dependent transcription of the wafl gene 
[El-DeiryDeiry et al, (1993), cited above]. The sequence of 
oligonucleotide BC.V4A (top strand) is: TC-GAGCA- 
TGTTC-GAGCA-TGTTC-GAGCATGT [SEQ ID NO: 17], 
and the sequence of oligonucleotide BC (top strand) is: 
CC-GGGCA-TGTCC-GGGCA-TGTCC-GGGCATGT 
(SEQ ID NO: 18]. Oligonucleotides BC.V4A [SEQ ID NO: 
17] and BC [SEQ ID NO: 18] contain artificial sites recog- 
nized by p53. For the three sites indicated above, the specific 
pentanucleotide repeats recognized by p53 are demarcated 
by hyphens. 

Plasmid pTK-SEAP drives expression of a secreted form 
of alkaline phosphatase under the control of a minimal 
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thymidine kinase promoter [Halazonetis (1992), Anticancer 
Res., 12:285-292]. It contains no p53 binding site, and thus 
serves as a control. 
F. Plasmid pSV2gpt 

Plasmid pSV2gpt [Mulligan et al, (1981), cited above] 5 
drives expression of gpt in mammalian cells. In these studies 
it only serves to bring the total amount of transfected DNA 
to 30 ug, when necessary. Expression of gpt does not 
interfere with p53 function. 
Example 2 — In Vitro Translation and DNA Binding Assay 1° 
Plasmids of the pGEMhump53 series of Example 1 were 
used to produce in vitro transcribed mRNA according to 
standard procedures [Halazonetis et al, (1988), cited above]. 
The mRNA is subsequently translated in vitro using prefer- 
ably rabbit reticulocyte lysate (Promega, Madison, Wis.) 15 
[Halazonetis et al (1988), cited above]. In vitro translated 
p53 can be used directly for DNA binding, without further 
purification. 

Alternate strategies for expression of p53 for DNA bind- 
ing assays include expression in E. coli or in Sf9 insect cells 
using appropriate vectors (many are commercially avail- 
able) for expression in bacterial cells or baculovirus vectors, 
respectively. Lysates or extracts prepared from bacterial or 
insect cells are used without purification, or optimally, 
following partial or complete purification using standard 
protein purification techniques [Scopes (1994), cited above]. 

The in vitro translated proteins were assayed for DNA 
binding, as previously described [Halazonetis et al, (1993), 
cited above]. 30 

Briefly, as exemplified below, the in vitro translated 
protein is incubated with a radioactively labeled oligonucle- 
otide containing a p53 binding site in the presence of 
non-specific competitor DNA. The reaction mixture is incu- 
bated 20 min. at room temperature and directly loaded on a 35 
native 5% polyacrylamide electrophoresis gel. In this type of 
DNA binding assay free DNA migrates to the bottom of the 
gel, whereas p53/DNA complexes migrate more slowly. 
Thus, the presence of slowly migrating DNA, which can be 
detected by autoradiography, indicates p53 DNA binding 40 
[Halazonetis et al (1993), cited above; Halazonetis and 
Kandil (1993), cited above]. 

As non-specific competitor DNAs, the following were 
used: 0. 1 ug single-stranded oligonucleotide MI7 
[GAGAGCCCCAGTTACCATAACTACTCT, SEQ ID NO: 45 
36] and 0.05 ug double-stranded oligonucleotide TF3 [AT- 
CACGTGATATCACGTGATATCACGTGAT, SEQ ID NO: 
37] per reaction. 

A number of double-stranded oligonucleotides containing 
p53 binding sites were radioactively labeled for these 50 
experiments. These included oligonucleotides Ewafl, 
BC.V4A and BC [SEQ ID NOS: 16, 17 and 18, respec- 
tively], and oligonucleotide BC.S21. The sequence of 
BC.S21 is: TAT-GGGCA-TGTCC-TATATATATGCG- 
TATATATAT-GGGCA-TGTCC-TAT [SEQ ID NO: 19]. The 55 
pentanucleotide repeats, which are recognized by p53, are 
indicated by hyphens. These DNAs were radioactively 
labeled using 32P-Iabeled nucleotides, as described [Hala- 
zonetis et al, (1988), cited above]. 

Results using this assay are presented in Example 3, 60 
below. 

Example 3 — DNA Binding Activities of p53 Proteins 

with Altered Tetramerization Domains 
A. DNA Binding Activities of Wild-type Human p53 $5 
The ability of wild-type human p53 to recognize the DNA 

sites present in oligonucleotides Ewafl, BC.V4A and BC 



[SEQ ID NOS: 16, 17 and 18, respectively] has been 
previously demonstrated [El-DeiryDeiry et al, (1993), cited 
above; Halazonetis et al, (1993), cited ab ve]. Using the 
assay described in Example 2, wild-type p53 rec gnized all 
these DNAs. The highest signal was obtained using the BC 
ligonucleotide [SEQ ID NO: 18], while oligonucleotide 
Ewafl [SEQ ID NO: 16] gave the weakest signal. The 
intensity of the signal in this assay reflects the affinity of p53 
for the different DNA sites. The intensity of the signal using 
oligonucleotides BC.V4A [SEQ ID NO: 17] or Ewafl [SEQ 
ID NO: 16] was enhanced in the presence of 0.1 ug anti-p53 
antibody PAb421 [Oncogene Science, Uniondale, N.YJ. 
This antibody activates DNA binding of wild-type p53, by 
switching the conformation of the protein [Halazonetis et al, 
(1993), cited above; Halazonetis and Kandil (1993), cited 
above]. Binding to oligonucleotide BC [SEQ ID NO: 18] is 
quite potent, and very little further enhancement is observed 
following incubation with antibody PAb421. In addition to 
antibody PAb421, the conformation of p53 can be switched 
by aC-terminal truncation that removes residues 364-393 of 
human p53 [SEQ ID NO: 2] [Halazonetis and Kandil (1993), 
cited above; Hupp et al (1992), Cell, 71 :875-886; Hupp and 
Lane (1994), Current Biology, 4:865-875]. The C-termi- 
nally truncated p53 protein, p53D364-393, bound all three 
oligonucleotides with high affinity, comparable to wild-type 
p53 in the presence of PAb421. 

Binding of wild-type p53 to oligonucleotide BC.S21 was 
also examined. This oligonucleotide [SEQ ID NO: 19] 
contains two pairs of contiguous pentanucleotide repeats 
separated by 21 nucleotides. Wild-type p53 bound efficiently 
to this DNA, as indicated by a strong signal in the DNA 
binding assay described in Example 2, above. The signal 
was as strong as with oligonucleotide BC [SEQ ID NO: 18] 
(which represents the optimal p53 DNA site) and was not 
further enhanced by antibody PAb421. As discussed in the 
Section V.B1, oligonucleotide BC.S21 [SEQ ID NO: 19] 
does not match the consensus p53 DNA site. Thus, the 
ability of wild-type p53 to bind to oligonucleotide BC.S21 
[SEQ ID NO: 19] is a novel finding. 

B. DNA Binding Activities of p53L2346E, p53LZ347, 
p53TZ334N and p53TZ323RGN 

Proteins p53LZ346E, p53LZ347, p53TZ334N and 
p53TZ323RGN represent chimeric proteins of the invention, 
which are encoded by plasmids pGEMhump53LZ346E, 
pGEMhump53LZ347, pGEMhump53TZ334N and 
pGEMhump53TZ323RGN, respectively, described in 
Example 1. The ability of these proteins to bind oligonucle- 
otides Ewafl, BC.V4A and BC [SEQ ID NOS: 16, 17 and 
18, respectively] was examined using the assay described in 
Example 2. All of them bound to all three oligonucleotides. 
The signal intensities were overall comparable to those of 
wild-type p53 bound to the respective oligonucleotides in 
the presence of PAM21 or to those of p53D364-393 bound 
to the respective oligonucleotides. p53LZ347, p53TZ334N 
and p53TZ323RGN exhibited somewhat higher affinity for 
DNA, as compared to p53LZ346E. 

The DNA complexes of proteins p53LZ346E, p53LZ347, 
p53TZ334N and p53TZ323RGN comigrated with the DNA 
complex of wild-type p53 or the DNA complex of p53D364- 
393. Since migration on acrylamide gels depends on the 
molecular size of the migrating species [Hope and Struhl 
(1987), EMBO J., 6:2781-2784] this indicates that the 
complexes of wild-type p53, p53D364-393, p53LZ346E, 
p53LZ347, p53TZ334N and p53TZ323RGN with DNA 
have similar molecular sizes. Since wild-type p53 and 
p53D364-393 bind DNA as tetramers [Halazonetis and 
Kandil (1993), cited above; Cho et al, (1994), cited above; 
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Hupp and Lane (1994), cited above], then p53LZ346E, 
p53LZ347, p53TZ334N and p53TZ323RGN also bind DNA 
as tetramers. 

C. DNA Binding Activities of p53LZ335Q and p53LZ343 
Proteins p53LZ335Q and p53LZ343 are chimeric pro- 
teins of p53 with the GCN4 leucine zipper, encoded by 
plasmids pGEMhump53LZ335Q and 

pGEMhump53LZ343, respectively. Protein p53LZ343 was 
first described by Pietenpol et al, (1994) [cited above]. The 
abOity of these proteins to bind oligonucleotides BCV4A 
and BC [SEQ ID NOS: 17 and 18, respectively] was 
examined using the assay described in Example 2. Both 
bound to oligonucleotide BC [SEQ ID NO: 18] (the optimal 
p53 DNA site [Halazonetis et al, (1993), cited above], but 
neither bound oligonucleotide BC.V4A [SEQ ID NO: 17] (a 
suboptimal site). Thus, the DNA binding activities of these 
proteins are compromised relative to wild-type p53. In 
addition the complexes of p53LZ335Q ami p53LZ343 with 
oligonucleotide BC [SEQ ID NO: 18] migrate significantly 
faster than the corresponding complexes of wild-type p53 or 
p53D364-393. Thus, the molecular sizes of the complexes of 
p53LZ335Q and p53LZ343 are smaller than those of wild- 
type p53 or p53D364-393. Since wild-type p53 and 
p53D364-393 are tetramers [Halazonetis and Kandil (1993), 
cited above; Cho et al, (1994), cited above; Hupp and Lane 25 
(1994), cited above], p53LZ335Q and p53LZ343 are 
diniers. They cannot be monomers, because monomeric p53 
does not bind DNA [Halazonetis and Kandil (1993), cited 
above]. 

In conclusion the ability of p53 proteins with altered 30 
tetramerization domains to bind to the full panel of p53 DNA 
sites correlates with their ability to form tetramers. Further- 
more, p53LZ335Q and p53LZ343 are not proteins of this 
invention, since they fail to form tetramers. 

D. DNA Binding Activities of p53Q334, p53L337, 35 
p53A341 and p53A344 

Proteins p53Q334, p53L337, p53A341 and p53A344 are 
encoded by plasmids pGEMhump53Q334, 
pGEMhump53L337, pGEMhump53A34I and 
pGEMhump53A344, respectively, described in Example L 
The ability of these proteins to bind oligonucleotide BC 
[SEQ ID NO: 18] was examined using the assay described 
in Example 2. Proteins p53Q334, p53L337 and p53A341 
bound DNA very weakly, if at all. This finding indicates that 
these substitutions disrupt completely the function of the 
p53 tetramerization domain, because they map within the 
tetramerization domain [Wang et al, (1994), cited above; 
aore et al, (1994), cited above] and DNA binding by p53 
requires oligomerization [Halazonetis and Kandil (1993), 
cited above]. 

Protein p53A344 bound oligonucleotide BC [SEQ ID 
NO: 18]. However, its complex with DNA migrated signifi- 
cantly faster than the DNA complexes of wild-type p53 or 
p53D364-393. Thus, p53A344 is a dimer, rather than a 
tetramer. Thus, this single amino acid substitution partially 
disrupts the function of the p53 tetramerization domain. 

E. DNA Binding Activities of p53D290-297, p53D290- 
297D300-321, p53D300-308, p53D300-317, p53D300-321, 
and p53D300-327 

Proteins p53D290-297, p53D290-297D300-321, 
p53D300-308, p53D300-317, p53D300-321 and p53D30O- 
327 are encoded by plasmids pGEMhump53D290-297, 
pGEMhump53D290-297D300-321, pGEMhump53D300- 
308, pGEMhump53D30O-317, pGEMhump53D300-321 
and pGEMhump53D300-327, respectively, described in 
Example 1. The ability of these proteins to bind oligonucle- 
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otides BG and BC.S21 [SEQ ID NOS: 18 and 19, respec- 
tively] was examined using the assay described in Example 
2. Proteins p53D290-297, p53D30O-308, p53D300-317, and 
p53D300-321 bound both oligonucleotides BC and BC.S21 
[SEQ ID NOS: 18 and 19, respectively] with efficiencies 
paralleling that of wild-type p53. In addition the complexes 
of these proteins with DNA comigrated with the complexes 
of wild-type p53 with DNA. Thus, p53D290-297, p53D300- 
308, p53D300-317 and p53D300-321 exhibit DNA binding 
properties similar to wild-type p53. 

Proteins p53D290-297D300-321 and p53D300-327 
formed aberrant complexes with oligonucleotide BC [SEQ 
ID NO:. 18]. These complexes migrated very slowly and the 
signal intensity was very low. In contrast, the complexes of 
p53D29Q-297D300-321 and p53D300-327 with oligonucle- 
otide BC.S21 [SEQ ID NO: 19], were similar to those of 
wild-type p53. Proteins p53D290-297D300-321 and 
p53D300-327 have diminished ability to recognize oligo- 
nucleotide BC [SEQ ID NO: 18], but their ability to recog-r 
nize oligonucleotide BC.S21 [SEQ ID NO: 19] is intact. 
Example A — Iinmunoprecipitation of p53 Proteins 
For immunoprecipitation experiments the p53 proteins 
were in vitro translated, as described in Example 2, in the 
presence of 35S-methionine, so that they would be radio- 
actively labeled. Following in vitro translation, 3 ul of the 
lysate containing the translated protein(s) was incubated in 
30 ul DNA binding buffer [Halazonetis et al, (1993), cited 
above] for 20 minutes, at room temperature. Then 0.6 ug 
anti-p53 antibody PAb421 [Oncogene Science, Uniondale, 
N.Y.], 30 ul packed protein G-Sepharose beads [Pharmacia, 
Piscataway, N.J.] and 400 ul high-salt/EDTA immunopre- 
cipitation buffer [Halazonetis et al, (1993), cited above] 
were added. After 45 min incubation at room temperature on 
a rotator, the beads were washed three times with the 
high-salt/EDTA immunoprecipitation buffer and the proteins 
absorbed to the beads were eluted with SDS sample buffer 
and subjected to SDS-PAGE. 
The use of this assay is described in Example 5 below. 
Example 5— Hetero-Oligomerization Assay 
This assay was used to demonstrate that p53LZ346E, one 
of the proteins of the invention encoded by plasmid 
pGEMhump53LZ346E (Example 1), does not hetero-oligo- 
merize with p53 proteins having intact native p53 tetramer- 
ization domains, such as wild-type p53 and tumor-derived 
p53 mutants. 

Mid-type p53 and p53LZ346E were cotranslated in the 
presence of 35S-methionine. Simultaneous translation of the 
two proteins provides opportunity for the two different 
subunit types to form hetero-oligomers. After cotranslation 
was completed, the mixture was immunoprecipitated using 
antibody PAb421, as described in Example 4 above. 

The epitope of antibody PAb421 maps to residues 
373-381 of human p53 [SEQ ID NO: 2]. Therefore, PAb421 
recognizes wild-type p53, but not p53LZ346E Thus, if the 
two proteins hetero-oligomerize when cotranslated, then 
both will be precipitated by PAM21. If they do not hetero- 
oligomerize, then only wild-type p53 will be precipitated. 

Following SDS-polyacrylamide gel electrophoresis, a 
radioactive band migrating like wild-type p53 was noted. No 
band corresponding to p53LZ346E was observed. These 
results indicate that wild-type p53 and p53LZ346E do not 
hetero- ligomerize. As a positive control the ability of 
wild-type p53 and p53D364-393 to hetero-oligomerize 
using this assay was examined. Like p53LZ346E, p53D364- 
393 lacks the epitope for PAb421. However, p53D364-393 
has an intact native p53 tetramerization domain. When 
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c translated with wild-type p53, both wild-type p53 and 
p53D364-393 were precipitated by PAb421, indicating that 
wild-type p53 and p53D364-393 form hetero-oligomers. 

Example 6 — Transcriptional Activity Assay 

An assay for transcriptional activity entails introducing 
vectors expressing wild-type p53 or p53 proteins of the 
invention into cells together with a reporter plasmid that 
expresses a reporter marker in a p53-dependent manner. 
Preferably the cells are human tumor cells that do not 
express endogenous p53, so that the transcriptional activity 
can be evaluated without interference from endogenous 
wild-type or mutant p53. [See, e.g., Lin et al, (1994), cited 
above; Wu et al, (1993), cited above; Chen et al, (1993), 
Oncogene, 8:2159-2166; Chumakov et al, (1993), Onco- 
gene, 8:3005-3011; Unger et al, (1992), EMBO J, 
11:1383-1390; Kem et al, (1992), Science, 256:827-830; 
Pietenpol et al, (1994), cited above]. 

Transcriptional activity was assayed in Saos-2 human 
osteosarcoma cells [ATCC HTB 85]. These cells do not 
contain any endogenous p53, because both p53 alleles are 
deleted [Diller et al, (1990), Mol. Cell. Biol., 
10:5772-5781]. Thus, any transcriptional activity can be 
attributed to the transfected p53. Transcriptional activity was 
assayed as previously described [Halazonetis (1992), Anti- 
cancer Res., 12:285-292]. Briefly, plasmids expressing p53 
in mammalian cells (of the pSV2 series described in 
Example 1) were cotransfected with reporter plasmids (of 
the pTK-SEAP series described in Example 1) using the 
calcium phosphate technique [Halazonetis (1992), cited 
above]. Alkaline phosphatase activity, which reflects p53- 
mediated transcriptional activity, was assayed as previously 
described [Halazonetis (1992), cited above]. 

The use of this assay is described in Examples 7 and 8 
below. 

Example 7 — Comparison of Transcriptional Activities of 
Wild-type p53, p53LZ346E, p53LZ347, p53LZ335Q, 
p53LZ343 and p53TZ334N 

Wild-type p53, p53LZ346E, p53LZ347, p53LZ335Q, 
p53LZ343 and p53TZ334N were expressed in Saos-2 cells 
[ATCC HTB 85] by transfecting plasmids pSV2hump53wt, 
pSV2hump53LZ346E, pSV2hump53LZ347, 
pSV2hump53LZ335Q, pSV2hump53LZ343 and 
pSV2hump53TZ334N, respectively. Hie transcriptional 
activities of the expressed proteins were assayed using one 
or more of the reporter plasmids pBC-TK-SEAP, pBC. V4A- 
TK-SEAP and pEwafl-TK-SEAP, described in Example 1. 

Results of representative experiments are presented in 
Table 1 below. The units of transcriptional activity are 
relative, and comparisons should only be made within the 
same experiment For each transfection the amounts of 
transfected plasmid are indicated in ug. Where the total 
amount was less than 30 ug, then plasmid pSV2gpt 
(described in Example 1) was used to bring the total to 30 55 

TABLE 1 



Experiment 1 

Reporter Plasmid: 10 pg pBC.V4A-TK-SEAP 



pS3 Expression 
Plasmid (2 pg) 



Transcriptional 
Activity 



10 



15 



20 



25 



30 



35 



40 



45 



50 



pSV2hump53wt 
pSV2hump53LZ347 
pSV2hump53L2335Q 
pSV2hump53LZ343 



1700 
524 
18 
22 
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TABLE 1 -continued 



Experiment 2 



p53 Expression 




•Transcriptional 


Plasmid (3 jig) 




Activity 


pSV2hump53wt 




1990 


pSV2hump53LZ346E 


520 


Experiment 3 






p53 Expression 


Reporter 


Transcriptional 


Plasmid (15 pg) 


Plasmid (15 ug) 


Activity 


pSV2hump53wt 


pEwafl-TK-SEAP 


1640 


pSV2hump53TZ334N 


pEwafl-TK-SEAP 


1480 


pSV2hump53TZ334N 


pBGV4A-TK-SEAP 


2750 


pSV2hump53TZ334N 


PBC-TK-SEAP 


2550 



From the results of Table 1 it is apparent that wild-type 
p53 is able to activate transcription from all the reporter 
plasmids examined, including plasmid pEwaf 1 -TK-SEAP, 
which contains the weakest p53 binding site. The tetrameric 
p53 proteins of the invention: p53LZ346E, p53LZ347 and 
p53TZ334N, all exhibit transcriptional activity. In contrast 
the dimeric p53 chimeric proteins, such as p53LZ335Q and 
p53LZ343, do not exhibit transcriptional activity that is 
detectably above background in this assay, and are thus 
clearly inferior to the tetrameric proteins of this invention. It 
is possible that the dimeric proteins may exhibit higher 
transcriptional activity when grossly overexpressed, since 
Pietenpol et al, (1994) [cited above] have reported detectable 
transcriptional activity using protein p53LZ343. The inven- 
tor has been unable to reproduce this result, as the Experi- 
ments of Table I indicate. Even if p53LZ343 has detectable 
transcriptional activity when grossly overexpressed, our 
experiments indicate that the tetrameric p53 proteins of this 
invention are clearly superior to the dimeric proteins. 

Example 8— Ability of 1\imor-Derived p53 Mutants to 
Suppress the Transcriptional Activities of wild-Type p53 or 
Tetrameric D53 Proteins of the Invention 

Ibmor-derived p53 mutants are known to suppress the 
transcriptional activity of wild-type p53 by forming hetero- 
tetramers with wild-type p53 [Milner and Medcalf (1991), 
cited above; Bargonetti et al, (1992), cited above; Fanner et 
al, (1992), cited above; Kem et al, (1992), cited above]. The 
tetrameric p53 proteins of this invention do not hetero- 
tetramerize with tumor-derived p53 mutants, because the 
native p53 tetramerization domain is partially or completely 
disrupted (See Example 5). Consequently, the transcrip- 
tional activities of the tetrameric p53 proteins should not be 
inhibited by tumor-derived p53 mutants. 

To establish this, the transcriptional activities of the p53 
proteins of this invention in the presence of excess of a 
tumor-derived p53 mutant were compared to their transcrip- 
tional activities in the absence of the tumor-derived mutant. 
The tumor-derived p53 mutant is expressed in cells using a 
suitable vector (as described in Example 1). Preferably, the 
tumor-derived mutant should be in 5 to 20-fold or more 
excess relative to the p53 protein of the invention, which can 
be roughly achieved by transfecting into the cells 5 to 
20-fold or more excess vector expressing the tumor-derived 
p53 mutant over the vector expressing the protein of the 
invention. 

1\jmor-derived mutants, p53Hisl75 and p53Ser249 are 
suitable for these studies because they potently inhibit the 
transcriptional activity of wild-type p53. These mutants 
have hisddine at position 175 or serine at position 249 of 
human p53 [SEQ ID NO: 2], respectively. Other tumor- 
derived p53 mutants [Caron de Fromentel and Soussi 
(1992), Genes Chrom. Cancer, 4:114 15] can also be used, 
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as long as they potently inhibit the transcriptional activity f 
wild-type p53. 

Results of relevant Experiments are presented in Tfcble 2 
below. Suppression of transcriptional activity by the tumor- 
derived p53 mutant is presented as percent of residual 
transcriptional activity in the presence of the tumor-derived 
p53 mutant, as compared to the transcriptional activity in the 
absence of the mutant For each transfection the amounts of 
transfected plasmids are indicated in jig. Where the total 
amount was less than 30 ng (as in the absence of the 
tumor-derived p53 mutant), then plasmid pSV2gpt 
(described in Example 1) was used to bring the total to 30 
jig. For these experiments the tumor-derived p53 mutant His 
175 was used (described in Example 1). 

TABLE 2 



Experiment 1 

Reporter Plasmid: 10 \i% pBC-TK-SEAP 
Plasmid for Tumor-Derived p53 Mutant: 19 ug 
pSV2hump53H175 



p53 Expression 
Plasmid (1 ug) 



Residual Transcriptional 
Activity in the Presence 
of Tumor-Derived p53 Mutant 



pSV2hump53wt 
pSV2hump53LZ346E 



23% 
100% 



10 



15 



20 



25 



30 



TABLE 2-continued 



pSV2hump53LZ347 46% 
Experiment 2 

Reporter Plasmid: 10 jig pBCV4A-TK-SEAP 
Plasmid for Tumor-Derived p53 Mutant: 19 ug 
pSV2hump53H175 



pS3 Expression 
Plasmid (1 ug) 



Residual Transcriptional 
Activity in the Presence 
of Tumor-Derived p53 Mutant 



pSV2hump53wt 
pSV2hump53TZ334N 



2% 
80% 



From the results of the experiments presented in Table 2 
it is apparent that, in contrast to wild-type p53, the tran- 
scriptional activities of the p53 tetrameric proteins of the 
invention are not suppressed, or only minimally suppressed, 
by gross excess of a tumor-derived p53 mutant. 

Numerous modifications and variations of the present 
invention are included in the above-identified specification 
and are expected to be obvious to one of skill in the art Such 
modifications and alterations to the compositions and pro- 
cesses of the present invention are believed to be encom- 
passed in the scope of the claims appended hereto. 



SEQUENCE LISTING 

( 1 ) GENERAL INFORMATION: 

( i i i ) NUMBER OF SEQUENCES: 37 

( 2 ) INFORMATION FOR SEQ ZD NO:l: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 1317 buc pain 
( B ) TYPE: nucleic acid 
( C ) STRANDEDNESS: double 
( D ) TOPOLOGY: linear 

( s i ) MOLECULE TYPE: DNA (genomic) 

( i x ) FEATURE: 

( A ) NAME/KEY: CDS 
( B ) LOCATION: 136.. 13 14 

( x i ) SEQUENCE DESCRIPTION: SEQ ID NChl: 

GTCT AG AGCC ACCGTCCAGG GAGCAGGTAG CTGCTGOGCT CCGGGGACAC TTTGCGTTCG 60 

GGCTGGG AG C GTGCTTTCCA CGACGGTGAC ACGCTTCCCT GGATTGGCAG CCAOACTGCC 120 

TTCCG GGTC A CTGCC ATG GAG GAG CCG CAG TCA GAT CCT AG C GTC GAG CCC 17 1 

Met Glu Glu Pro Gin Scr Asp Pro Scr Val Glu Pro 
1 5 10 

CCT CTG AGT CAG G A A AC A TTT TCA G AC CTA TGG AAA CTA CTT CCT GAA 219 
Pro Leu Scr Gin Glu Thr Phc Scr Asp Leu Trp Lys Leu Leu Pro Glu 
15 2 0 2 5 

AAC AAC OTT CTG TCC CCC TTG CCG TCC CAA GCA ATG GAT GAT TTO ATG 267 
Asn Asn Val Leu Scr Pro Leu Pro Scr Gin All Mel Asp Asp Leu Mel 
30 3 5 40 

CTO TCC CCG OAC GAT ATT GAA CAA TGG TTC ACT GAA GAC CCA GGT CCA 3 15 

Leu Scr Pro Asp Asp lie Glu Gin Trp Phc Thr Glu Asp Pro Gly Pro 
45 30 55 60 

GAT GAA OCT CCC AGA AT CCA GAG OCT GCT CCC CCC GTG CC CCT GCA 363 
Asp Glu Alt Pro Arg Mcl Pro Glu Alt Ala Pro Pro Val Ala Pro Ala 
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CCA GCA GCT 
Pro Ala Ala 



CTG TCA TCT 

Leu Scr Scr 
9 5 

TTC CGT CTG 

Phc Arg Leu 
1 1 0 

ACG TAC TCC 

Tbr Tyr Scr 
1 2 5 

TGC CCT GTG 

Cys Pro Va 1 



GTC CGC GCC 
V a 1 Arg Ala 



GTG AGG CGC 

V a 1 Arg Arg 
1 7 5 

GCC CCT CCT 

Ala Pro Pro 
1 9 0 

TAT TTG GAT 

Tyr Leu Asp 
20S 

GAG CCG CCT 

Glu Pro Pro 



ATG TGT A A C 
Met Cys Asu 



ACC AT C ATC 

T h r lie lie 
2 5 5 

AGC TTT GAG 

Scr Phc Glu 
2 7 0 

GAG OA A GAG 

Glu Glu Glu 
2 8 5 

CCA GOG AGC 

Pro Gly Scr 



CAG CCA A A G 
Gin Pro Lys 



CGT GGG CGT 

Arg Gly Arg 
3 3 5 

G A A CTC A A G 

Glu Leu Lys 
3 5 0 

CAC TCC AGC 

His Scr Scr 
3 6 5 

AAA AAA CTC 

Lys Lys Leu 



6 5 

CCT ACA CCG 

Pro Thr Pro 
8 0 

TCT GTC CCT 

Scr V a 1 Pro 



GGC TTC TTG 
Gly Pbc Leu 



CCT GCC CTC 

Pro Ala Leu 
1 3 0 

CAG CTG TGG 

Gin Leu Trp 
1 4 5 

ATG GCC ATC 

Me i Ala lie 
1 6 0 

TGC CCC CAC 

Cys Pro His 



CAG CAT CTT 
Gin His Leu 



GAC AGA AAC 

Asp Arg Asa 
2 1 0 

GAG GTT GGC 

Glu V a 1 Gly 
2 2 5 

AG T TCC TGC 

Scr Scr Cys 
2 4 0 

ACA CTG G A A 

Thr Leu Glu 



GTG CGT GTT 

V a 1 Arg V a 1 

A A T CTC CGC 

Asa Leu Arg 
2 9 0 

ACT A A G CGA 

Thr Lys Arg 
3 0 5 

A A G AAA CCA 

Lys Lys Pro 
3 2 0 

GAG CGC TTC 

Glu Arg Phc 



GAT GCC CAG 
Asp Ala Gin 



CAC CTG A AG 
His Leu Lys 
3 7 0 

ATG TTC A A G 
Mel Phc Lys 



GCG GCC CCT 

Ala Ala Pro 
8 5 

TCC CAG AAA 

Scr Gin Lys 
1 0 0 

CAT TCT GGG 

His Scr Gly 
1 1 5 

AAC A AG ATG 

Asn Lys Met 



GTT GAT TCC 
Val Asp Scr 



TAC A AG CAG 

Tyr Lys Glo 
1 6 5 

CAT GAG CGC 

His Glu Arg 
1 8 0 

ATC CGA GTG 

lie Arg Val 
1 9 5 

ACT TTT CGA 

Thr Phc Arg 



TCT GAC TGT 
Scr Asp Cys 



ATG GGC GGC 

Met Gly Gly 
2 4 5 

GAC TCC AGT 

Asp Scr Scr 
2 6 0 

TGT GCC TGT 

Cys Ala Cys 
2 7 5 

A AG AAA GGG 

Lys Lys Gly 



GCA CTG CCC 
Ala Leu Pro 



CTG GAT GGA 

Leu Asp Gly 
3 2 5 

GAG ATG TTC 

Glu Met Phc 
3 4 0 

GCT GGG A AG 

Ala Gly Lys 
3 5 5 

TCC AAA A A G 

Scr Lys Lys 



ACA G A A GGG 
Thr Glu Gly 



7 0 

GCA CCA GCC 
Ala Pro Ala 



ACC TAC CAG 
Thr Tyr Gin 



ACA GCC A AG 

Thr Ala Lys 
1 20 

TTT TGC CAA 

Phc C y i Gin 
1 3 5 

ACA CCC CCG 

Thr Pro Pro 
1 5 0 

TCA CAG CAC 

Scr Gin His 



TGC TCA GAT 
Cys Scr Asp 



G A A GGA A A T 

Glu Gly Asn 
2 00 

CAT AGT GTG 

His Scr Val 
2 1 5 

ACC ACC ATC 

Thr Thr lie 
2 3 0 

ATG AAC CGG 

Met Asn Arg 



GOT A A T CTA 
Gly Asn Leu 



CCT GGG AGA 

Pro Gly Arg 
2 8 0 

GAG CCT CAC 

Glu Pro His 
2 9 5 

AAC AAC ACC 

Asn Asn Thr 
3 1 0 

G A A TAT TTC 

Glu Tyr Pbc 



CGA GAG CTG 
Arg Glu Leu 



GAG CCA OGG 

Glu Pro Gly 
3 60 

GGT CAG TCT 

Gly Gin Scr 
3 7 5 

CCT GAC TCA 

Pro Asp Scr 



7 5 

CCC TCC TGG 

Pro Scr Trp 
9 0 

GGC AGC TAC 

Gly Scr Tyr 
1 0 5 

TCT GTA ACT 

Scr Val Thr 



CTG GCC A AG 
Leu Ala Lys 



CCC GGC ACC 

Pro Gly Thr 
1 5 5 

ATG ACG GAG 

Met Thr Glu 
1 7 0 

AGC GAT GGT 

Scr Asp Gly 
1 8 5 

TTG CGT GTG 

Leu Arg Val 



GTG GTG CCC 
Val Val Pro 



CAC TAC AAC 

His Tyr Asn 
2 3 5 

AGA CCC ATC 

Arg Pro lie 
2 5 0 

CTG OGA CGG 

Leu Gly Arg 
2 6 5 

GAC CGG CGC 

Asp Arg Arg 



CAC GAG CTG 
His Glu Leu 



AGC TCC TCT 

Scr Scr Scr 
3 1 5 

ACC CTT CAG 

Thr Leu Gin 
3 3 0 

A A T GAG GCC 

Asn Glu Ala 
3 45 

GGG AGC AGG 

Gly Scr Arg 



ACC TCC CGC 

Tbr Scr Arg 

GAC TGA 
Asp 



CCC 4 I 1 

P r o 



GGT 459 
G 1 y 



TGC 507 
Cys 



ACC 555 
Th r 
1 4 0 

CGC 603 
Arg 



GTT 6 5 1 

V a 1 



CTG 699 
Leu 



GAG 747 
G I u 



TAT 7 9 5 

Tyr 

2 2 0 

TAC 843 
Tyr 



CTC 891 
Leu 



AAC- 939 
Asn 



ACA 9 8 7 

Thr 



CCC 10 3 5 

P r o 
3 0 0 

CCC 10 8 3 

P r o 



ATC 113 1 

1 1 c 



TTG 117 9 

Leu 



GCT 12 2 7 

A 1 a 



CAT 12 7 5 

H i s 
3 8 0 

13 17 
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3 8 5 



3 9 0 



( 2 ) INFORMATION FOR SEQ ED NO:2: 

( i ) SEQUENCE CHARACTERISTICS: 

( A ) LENGTH: 393 unino acids 
( B ) TYPE: ammo acid 
( D ) TOPOLOGY: linear 

( i i ) MOLECULE TYPE: protein 

( x i ) SEQUENCE DESCRIPTION: SEQ ID NO:2: 

Me l Glu Gla Pro Gin Scr Asp Pro 
1 5 

Glu Thr Phc Scr Asp Leu Trp Lys 
2 0 

Scr Pro Leu Pro Scr Gin Ala Met 
.3 5 4 0 

Asp lie Gin Gin Trp Phc Thr Glu 
5 0 5 5 

Arg Met Pro Glu Ala Ala Pro Pro 
6 5 7 0 

Thr Pro Als Ala Pro Ala Pro Ala 
8 5 

Va 1 Pro Scr Gin Lys Thr Tyr Gin 

1 00 

Phc Leu His Scr Gly Thr Ala Lys 

115 120 

Ala Leu Asn Lys Met Phc Cys Gin 
130 135 

Leu Trp Val Asp Scr Thr Pro Pro 
14 5 15 0 

Ala lie Tyr Lys Gin Scr Gin His 

1 65 

Pro His His Glu Arg Cys Scr Asp 
18 0 

His Leu lie Arg Val Glu Gly Asn 
195 200 

Arg Asn Thr Phc Arg His Scr Yal 
2 10 2 15 

Val Gly Ser Asp Cys Thr Thr lie 
22 5 2 3 0 

Scr Cys Mel Gly Gly Mel Asn Arg 

2 4 5 

Leu Glu Asp Scr Ser Gly Asn Leu 

2 6 0 

Arg Val Cys Ala Cys Pro Gly Arg 

275 280 

Leu Arg Lys Lys Gly Glu Pro His 
2 9 0 2 9 5 

Lys Arg Ala Leu Pro Asn Asn Thr 
3 0 5 3 10 

Lys Pro Leu Asp Gly Glu Tyr Phc 

3 2 5 

Arg Phe Glu Mc i Phc Arg Glu Leu 

3 4 0 



Scr Val Glu Pro Pro Leu Scr Gin 
10 15 

Leu Leu Pro Glu Asn Asn Val Leu 

2 5 3 0 

Asp Asp Leu Met Leu Scr Pro Asp 
4 5 

Asp Pro Gly Pro Asp Olu Ala Pro 
6 0 

Val Ala Pro Ala Pro Ala Ala Pro 

75 80 

Pro Scr Trp Pro Leu Scr Scr Scr 
9 0 9 5 

Gly Ser Tyr Gly Phc Arg Leu Gly 
10 5 110 

Ser Val Thr Cys Thr Tyr Scr Pro 

1 2 5 

Leu Ala Lys Thr Cys Pro Val Gin 
140 

Pro Gly Thr Arg Val Arg Ala Met 

155 160 

Met Thr Glu Val Val Arg Arg Cys 
17 0 17 5 

Ser Asp Gly Leu Ala Pro Pro Gin 
18 5 190 

Leu Arg Val Glu Tyr Leu Asp Asp 

2 0 5 

Val Val Pro Tyr Glu Pro Pro Glu 
22 0 

His Tyr Asn Tyr Met Cys Asn Scr 

2 3 5 2 4 0 

Arg Pro lie Leu Thr lie lie Thr 

2 5 0 - 2 5 5 

Leu Gly Arg Ain Scr Phc Glu Vil 
265 270 

Asp Arg Arg Thr Glu Glu Glu Asn 
2 8 5 

His Glu Leu Pro Pro Gly Scr Thr 
3 0 0 

Scr Scr Ser Pro Gin Pro Lys Lys 

3 15 3 2 0 

Thr Leu Gin lie Arg Gly Arg Glu 

3 3 0 3 3 5 

Asn Glu Ala Leu Glu L c, u Lys Asp 

3 4 5 3 5 0 
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Ala Gin Al i Gly Ly « Gl u Pro Gl y Gly Scr Arg Ala His Scr Set His 
3 5 5 360 365 

Leu Lys Scr Lys Lys Gly Gin Scr Thr Scr Arg His Lys Lys Leu Met 
370 375 380 

Pbc Lys Thr Gl u Gly Pro Asp Scr Asp 
3 8 5 3 9 0 

( 2 ) INFORMATION FOR SEQ ID NO:3: 

( i ) SEQUENCE CHARACTERISTICS: 

( A ) LENGTH: 1824 base pairs 
( B ) TYPE: nucleic acid 
( C ) STRANDEDNESS: double 
( D ) TOPOLOGY: linear 

( i i ) MOLECULE TYPE: DNA (genomic) 

( i x ) FEATURE: 

( A ) NAME/KEY: CDS 
( B ) LOCATION: 778.. 1623 

( x i ) SEQUENCE DESCRIPTION: SEQ ID N03: 

ATCTTCGGGG ATATAAAGTG CATGAGCATA CATCTTGAAA AAAAAAGATG AAAAATTTCC 60 

GACTTTAAAT ACGGAAGATA AATACTCC A A CCTTTTT TTC CAATTCCOAA ATTTTAGTCT 120 

T CTTT A A AG A AGTTTCGGCT CGCTGTCTTA CCTTT T A A A A TCTTCTACTT CTTG AC AG T A 180 

C TT AT C TTC T TATATAATAG ATATACAAAA C AAA AC AAA A CAAAAACTCA CAACACAGGT 240 

TACTCTCCCC CCTAAATTCA AATTTTTTTT GCCCATCAGT T TC ACT AGCG AATTATACAA 300 

CTCACCAGCC A C AC AGCTC A CTCATCTACT TCGCAATCAA AACAAAATAT TTTATTTTAG 360 

TTCAGTTTAT TAAGTTATTA TCAGTATCGT ATTAAAAAAT TAAAGATCAT TG A A A A ATGG 420 

CTTGCTAAAC CGATTATATT TTGTTTTTAA AGTAGATTAT TATTAGAAAA TTATTAAGAG 480 

AATTATGTGT T A A ATTTATT GAAAGAGAAA ATTTATTTTC CC TT ATT A AT TAAAGTCCTT 540 

TACTTTTTTT G A A AACTGTC AGTTTTTTGA AOAGTTATTT GTTTTGTTAC CAATTGCTAT 600 

CATGTACCCG TAGAATTTTA TTC A AG ATGT TTC CG T A AC G GTTACCTTTC TGTC A A ATTA 660 

TCCAGOTTTA CTCGCCAATA AAAATTTCCC TATACTATCA T T A A TT A A A T CATTATTATT 720 

ACTAAAGTTT TGTTTACCAA TTTGTCTGCT CAAGAAAATA AATTAAATAC A A A T A A A 777 

ATG TCC G A A TAT CAG CCA AGT TTA TTT GCT TTA A A T CCA AT G GGT TTC 825 
Met Scr Glu Tyr Gin Pro Scr Leu Pbc Ala Leu Asn Pro Met Gly Pbc 
1 5 10 15 

TC A CCA T TG GAT GGT TCT AAA TCA ACC AAC G A A A A T GTA TCT GCT TCC 873 
S.cr Pro Leu Asp Gly Scr Lys Scr Thr Asn Glu Asn Val Scr Ala Scr 
2 0 2 5 3 0 

ACT TCT ACT GCC AAA CCA ATG GTT OGC CAA TTG ATT TTT GAT AAA TTC 921 
Thr Scr Thr Ala Lys Pro Met Val Gly Gin Leu lie Pbc Asp Lys Pbc 
3 5 4 0 4 5 

A TC A A G ACT G A A GAG GAT CCA ATT A TC AAA CAG GAT ACC CCT TCG AAC 969 
tic Lys Thr Glu Glu Asp Pro 11c lie Lys Gin Asp Tbr Pro Scr Asn 
5 0 5 5 6 0 

C TT GAT TTT GAT TTT GCT CTT CCA CAA ACG GCA ACT OCA CCT OAT GCC 1017 
Leu Asp Phc Asp Phc Ala Leu Pro Gin Thr Ala Tbr Ala Pro Asp Ala 
65 70 75 80 

A AG ACC GTT TTG CCA ATT CCG GAG C T A GAT GAC GCT GTA GTG G A A TCT 1065 
Lys Thr Val Leu Pro lie Pro Glu Leu Asp Asp Ala Val Val Glu Scr 
8 5 9 0 9 5 

TTC TTT TCG TCA AGC ACT GAT TCA ACT CCA ATG TTT GAG TAT G A A AAC 1113 
Pbc Pbc Scr Scr Scr Thr Asp Scr Tbr Pro Met Pbc Glu Tyr Glu Asn 
10 0 10 5 110 
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CTA AA G AC AAC TCT AAA GAA T G ACA TCC TTG TTT G AC A AT GAC ATT 1161 
Leo 01 u. Asp Aid Scr Lys 01 u Trp Thr Sci Leu Phc Asp Asn Asp lie 
115 12 0 12 5 

CCA GTT ACC ACT GAC GAT GTT TCA TTG GCT GAT A AG GCA ATT GAA TCC 1209 
Pro Val Tbr Thr Asp Asp Vil Scr Leu Ala Asp Lys Ala lie Glu Scr 
13 0 13 5 14 0 

ACT GAA GAA GTT TCT CTG GTA CCA TCC A A T CTG GAA GTC TCG ACA ACT 1257 
Thr Glu Glu Val Scr Leu Val Pro Scr Asn Leu Glu Val Scr Tbr Thr 
145 150 155 160 

TCA TTC TTA CCC ACT CCT GTT CTA GAA GAT GCT AAA CTG ACT CAA ACA 1305 
Scr Phc Leu Pro Thr Pro Val Leu Glu Asp Ala Lys Leu Thr Gin Thr 
16 5 17 0 17 5 

AGA A AG GTT A AG AAA CCA A AT TCA GTC GTT A AG A AG TCA CAT CAT GTT 1353 
Arg Lys Val Lys Lys Pro Asn Scr Val Val Lys Lys Scr His His Val 
18 0 18 5 19 0 

GGA A AG GAT GAC GAA TCG AGA CTG GAT CAT CTA GGT GTT GTT GCT TAG 1401 
Oly Lys Asp Asp Glu Scr Arg Leu Asp His Leu Gly Val Val Ala Tyr 
195 200 205 

AAC CGC AAA CAG COT TCG ATT CCA CTT TCT CCA ATT GTG CCC GAA TCC 1449 
Asn Arg Lys Gin Arg Scr lie Pro Leu Scr Pro lie Val Pro Glu Scr 
2 10 2 15 2 2 0 

AGT GAT CCT GCT GCT CTA AAA CGT GCT AGA AAC ACT GAA GCC GCC AGG 1497 
Scr Asp Pro Ala Ala Leu Lys Arg Ala Arg Asn Thr Glu Ala Ala Arg 
225 230 235 240 

CGT TCT CGT OCG AGA A AG TTG CAA AGA ATG AAA CAA CTT OA A GAC A AG 1545 
Arg Scr Arg Ala Arg Lys Leo Oln Arg Mcl Lys Gin Leu Glu Asp Lys 
245 250 255 

GTT GAA GAA TTG CTT TCG AAA A AT TAT CAC TTG GAA A A T GAG GTT GCC 1593 
Val Glu Ql o Leu Leu Scr Lys Asn Tyr His Leu Glu Asn Glu Val Ala 
260 265 270 

AGA TTA A AG AAA TTA GTT OGC GAA CGC TGATTTCATT TACCTTTTAT 1640 
Arg Leu Lys Lys Leu Val Gly Glu Arg 
2 7 5 2 8 0 

TTT AT ATTTT TTATTTCATT CTCGTGT AT A ACG A AATAGA TACATTCACT TAGATAAGAA 1700 

TTT A A TCTTT TTTATGCC A A TTTTCTTAAG TAGAATTTTA CACCACGCAT TTATAATCTG 1760 

CCGTATGTTC TGGTATTTAC TGGTT AGG A A TAGATAAAAA A A AC ACTC AC GATGGGGGTC 1820 

G A AC 1824 

( 2 ) INFORMATION FOR SEQ ID NO:4: 

( i ) SEQUENCE CHARACTERISTICS: 

( A ) LENGTH: 281 amino acids 
( B ) TYPE: amino acid 
( D ) TOPOLOGY: linear 

( i i ) MOLECULE TYPE: protein 

( x i ) SEQUENCE DESCRIPTION: SEQ ID NO:4: 

Met Scr Glu Tyr Gin Pro Scr Leu Phc Ala Leu Asn Pro Met Gly Phc 
i 5 10 15 

Scr Pro Leu Asp Gly Scr Lys Scr Thr Asn Glu Asn Val Scr Ala Scr 
2 0 2 5 3 0 

Tbr Scr Thr Ala Lys Pro Mcl Val Gly Gin Leu lie Phc Asp Lys Phc 
3 5 4 0 4 5 

lie Lys Thr Glu Glu Asp Pro lie lie Lys Gin Asp Thr Pro Scr Asn 
5 0 5 5 6 0 

Leu Asp Phc Asp Phc Ala Leu Pro Gin Tbr 'Ala Tbr Ala Pro Asp Ala 
65 70 75 80 

Lys Thr Val Leu Pro lie Pro Glu Leu Asp Asp Ala Val Val Glu Scr 
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8 5 9 0 9 5 

Phc Pbc Scr Scr Scr Thr Asp Scr Tbr Pro Mcl Pbc GIu Tyr Glu Asn 
10 0 10 5 110 

Leu Glu Asp Am Scr Lys Glu Trp Thr Scr Leu Phc Aj p Am Asp 11c 
115 12 0 12 5 

Pro Val Thr Thr Asp Asp Val Scr Leu Ala Asp Lys Ala lie Glu Scr 
130 13 5 140 

Thr Glu Glu Val Scr Leu Val Pro Scr Am Leu Glu Vol Scr Thr Thr 
145 150 155 160 

Scr Phc Leu Pro Thr Pro Val Leu Glu Asp Ala Lys Leu Tbr Gin Tbr 
16 5 17 0 17 5 

Arg Lys Val Lys Lys Pro Asn Scr Val Val Lys Lys Scr His His Val 
18 0 18 5 19 0 

Gly Lys Asp Asp Glu Scr Arg Leu Asp His Leu Gly Val Val Ala Tyr 
195 200 205 

Asn Arg Lys Gin Arg Scr lie Pro Leu Scr Pro lie Val Pro Glu Scr 
2 10 2 15 2 20 

Scr Asp Pro Ala Ala Leu Lys Arg Ala Arg Asn Thr Glu Ala Ala Arg 
225 230 235 240 

Arg Scr Arg Ala Arg Lys Leu Gin Arg Mcl Lys Gin Leu Glu Asp Lys 
245 250 255 

Val Glu Glu Leu Leu Ser Lys Asn Tyr His Leu Glu Asn Glu Val Ala 
260 265 270 

Arg Leu Lys Lys Leu Val Gly Glu Arg 
275 280 

( 2 ) INFORMATION FOR SEQ ID NO:5: 

( i ) SEQUENCE CHARACTERISTICS: 

( A ) LENGTH: 1824 base pairs 
( B ) TYPE: nucleic acid 
( C ) STRANDEDNESS: double 
( D ) TOPOLOGY: linear 

( i i ) MOLECULE TYPE: DNA (genomic) 

( i z ) FEATURE: 

( A ) NAME/KEY: CDS 
( B ) LOCATION: 778.. 1620 

( x i ) SEQUENCE DESCRIPTION: SEQ ID NO 

ATCTTCGGGO ATATAAAGTG CATGAGCATA CATCTTGA A A AAAAAAGATG AAAAATTTCC 60 

GACTTTAAAT A COG A A G A T A AATACTCCAA CCTTTT TTTC C AATTCCG A A ATTTTAGTCT 120 

TCTTT AAAG A AGTTTCGGCT CGCTGTCTTA CCTTTTAAAA TCTTCTACTT CTTG AC AGT A 180 

CTTATCTTCT TATATAATAG ATATACAAAA CAAAACAAAA C AA AAACTC A CAACACAOGT 240 

TACTCTCCCC CCTAAATTCA AATTTTTTTT GCCCATCAGT TTC ACT AGCG A ATT AT AC A A 300 

CTCACCAGCC ACACAGCTCA CTC ATCTACT TCGCAATCAA AACAAAATAT TTTATTTTAG 360 

TTCAGTTTAT TAAGTTATTA TCAGTATCGT ATTAAAAAAT TAAAGATCAT TGAAAAATGG 420 

C TTGC T A A AC CGATTATATT TTGTTTTTAA AGT AG AT TAT T ATT AG AAA A TTATTAAGAG 480 

AATTATOTGT TAAATTTATT GAAAGAGAAA ATTT ATTTTC CCTTATTAAT TAAAGTCCTT 540 

TACTTTTTTT GAAAACTGTC AGTTTTTTGA AGAGTTATTT GTTTTGTT AC CAATTGCTAT 600 

CATGTACCCG TAGAATTTTA TTC A AG A TG T TTCCGT A ACG GTTACCTTTC TGTC A A ATT A 660 

TCC AGGTTT A CTCGCCAATA AAAATTTCCC TAT ACT ATC A T T A ATT A A A T CATTATTATT 720 

ACTAAAGTTT TGTTTACCAA T TTGTC TGCT CAAGAAAATA A ATT AA AT A C A A A T A A A 777 
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ATG TCC OA A TAT CAG CCA AOT TTA TTT OCT TTA A A T CCA ATG GOT TTC 

Met Scr Glu Tyr Gin Pro Set Leu Phc Ala Leu Ain Pro Me i Gly Phc 

15 10 15 

TCA CCA TTG GAT GGT TCT AAA TCA ACC AAC OA A A A T GTA TCT GCT TCC 

Scr Pro Leu Asp Gly Scr Lyi Scr Thr Asn Glu Ain Val Scr Ala Scr 

2 0 2 5 3 0 

ACT TCT ACT GCC AAA CCA ATG OTT GGC CAA TTG ATT TTT GAT AAA TTC 

Thr Scr Thr Ala Lys Pro Met Val Gly Gin Leu lie Phc Aip Lys Phc 

3 5 4 0 4 5 

ATC A AG ACT G A A GAG GAT CCA ATT ATC AAA CAG GAT ACC CCT TCG AAC 

1 i o Lys Thr Glu Glu Asp Pro lie lie Lys Gin Alp Thr Pro Scr Asn 

5 0 5 5 60 

CTT GAT TTT GAT TTT GCT CTT CCA CAA ACG GCA ACT OCA CCT GAT GCC 

Leu Asp Phc Asp Phc Ala Leu Pro Gin Thr Ala Thr Ala Pro Asp Ala 

65 7 0 7 5 80 

A AG ACC GTT TTG CCA ATT CCG GAG CTA GAT GAC GCT GTA GTG OA A TCT 

Lys Thr Val Leu Pro lie Pro Glu Leu Asp Asp Ala Val Val Glu Scr 

8 5 9 0 » 9 5 

TTC TTT TCG TCA AGC ACT GAT TCA ACT CCA ATG TTT GAG TAT OA A AAC 

Phc Phc Scr Scr Scr Thr Asp Scr Thr Pro Met Phc Glu Tyr Glu Asn 

10 0 10 5 110 

CTA G A A GAC AAC TCT AAA G A A TGG ACA TCC TTG TTT GAC A AT GAC ATT 

Leu Glu Asp Asn Scr Lys Glu Trp Thr Scr Leu Phc Asp Asn Asp lie 

115 12 0 12 5 

CCA GTT ACC ACT, GAC GAT GTT TCA TTG GCT GAT A AG GCA ATT GAA TCC 

Pro Val Thr Thr Asp Asp Val Scr Leu Ala Asp Lys Ala lie Glu Scr 

13 0 13 5 14 0 

ACT GAA GAA GTT TCT CTG GTA CCA TCC A A T CTG GAA GTC TCG ACA ACT 

Thr Glu Glu Val Scr Leu Val Pro Scr Asu Leu Glu Val Scr Thr Thr 

145 150 155 160 

TCA TTC TTA CCC ACT CCT GTT CTA GAA GAT GCT AAA CTG ACT CAA ACA 

Scr Phc Leu Pro Thr Pro Val Leu Glu Aip Ala Lys Leu Thr Gin Thr 

16 5 17 0 17 5 

AG A A AG GTT A AG AAA CCA A A T TCA GTC GTT A A G A A G TCA CAT CAT GTT 

Arg Lys Val Lys Lys Pro Asn Scr V s 1 Val Lys Lys Scr His His Val 

18 0 18 5 19 0 

GGA A AG GAT GAC GAA TCG AGA CTG GAT CAT CTA GGT GTT GTT GCT TAC 

Gly Lys Asp Asp Glu Scr Arg Leu Asp His Leu Gly Val Val Ala Tyr 

195 200 205 

AAC CGC AAA CAG CGT TCG ATT CCA CTT TCT CCA ATT GTG CCC GAA TCC 

Asn Arg Lys Gin Arg Scr lie Pro Leu Scr Pro lie Val Pro Glu Scr 

2 10 2 15 220 

AG T GAT CCT GCT GCT CTA AAA CGT GCT AGA AAC ACT GAA GCC GCC AGG 

Scr Asp Pro Ala Ala Leu Lys Arg Ala Arg Asn Thr Glu Ala Ala Arg 

225 230 . 235 240 

CGT TCT CGT GCG AGA A AG TTG CAA CGT CTG AAA CAG ATC GAA GAC A AG 

Arg Scr Arg Ala Arg Lys Leu Gin Arg Leu Lys Gin lie Glu Asp Lys 

245 250 255 

TTA GAA GAA ATC CTT TCG A AG CTC TAT CAC ATC GAG A AT GAG TTA GCG 

Leu Glu Glu 11c Leu Ser Lys Leu Tyr His 11c Glu Asn Glu Leu Ala 

260 265 270 

CGC ATC AAA AAA TTA TTG GGT GAA CGT TGATTTCATT TACCTTTTAT 

Arg lie Lys Lys Leu Leu Gly Glu Arg 

275 280 

TTTATATTTT TTATTTCATT CTCGTGTATA ACGAAATAGA TACATTCACT TAGATAAGAA 

TTT AATCTTT TTTATGCCAA TTTTCTT A AO TAGAATTTTA CACCACGCAT TTATAATCTG 

CCGT ATGTTC TGGTATTTAC TGOTTAGGAA T AGATA A A A A A AAC ACTC AC GATGGGGGTC 

G A AC 



8 2 5 



8 7 3 



9 2 1 



9 6 9 



10 17 



10 6 5 



1113 



1 6 1 



12 0 9 



12 5 7 



13 0 5 



14 0 1 
14 4 9 

14 9 7 

15 4 5 

15 9 3 

16 4 0 

17 0 0 

17 6 0 

18 2 0 
18 2 4 
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( 2 } INFORMATION FOR SEQ ID NO:6: 

( i ) SEQUENCE CHARACTERISTICS: 

( A ) LENGTH: 281 amino adds 
( B ) TYPE: amino acid 
( D ) TOPOLOGY: linear 

( i i ) MOLECULE TYPE: protein 

( x i ) SEQUENCE DESCRIPTION: SEQ ID NO;6: 

Mcl Scr Glu Tyr Gin Pro Scr Leu P h c Ala Leu Asa Pro Mcl Gly Pbc 
1 5 10 15 

Scr Pro Leu Asp Gly Scr Lys Scr Thr An Glu Asn Vat Scr Ala Scr 
2 0 2 5 3 0 

Thr Scr Thr Ala Lys Pro Met Val Gly Gin Leu lie Pbc Asp Lys Phc 
3 5 4 0 4 5 

lie Ly i Thr Glu Glu Asp Pro lie lie Lys Gin Asp Thr Pro Scr Asn 
5 0 5 5 6 0 

Leu Asp Phc Asp Phc Ala Leu Pro Gin Thr Ala Thr Ala Pro Asp Ala 
65 70 75 80 

Lys Thr Val Leu Pro lie Pro Glu Leu Asp Asp Ala Val Val Glu Ser 
8 5 9 0 9 5 

Phc Phc Scr Scr Scr Thr Asp Scr Thr Pro Mcl Phc Glu Tyr Glu Asn 
10 0 10 5 110 

Leu Glu Asp Asn Scr Lys Glu Trp Thr Scr Leu Phc Asp Asn Asp lie 
115 12 0 12 5 

Pro Val Thr Thr Asp Asp Val Scr Leu Ala Aap Lys Ala lie Glu Scr 
13 0 13 5 14 0 

Thr 01 u Glu Val Scr Leu Val Pro Scr Asn Leu Glu Val Scr Thr Thr 
145 150 155 160 

Scr Phc Leu Pro Thr Pro Val Leu Glu Asp Ala Lys Leu Thr Gin Thr 
16 5 17 0 17 5 

Arg Lys Val Lys Lys Pro Asn Scr Val Val Lys Lys Scr His Hit Val 
180 185 190 

Gly Lys Asp Asp Glu Scr Arg Leu Asp His Leu Gly Val Val Ala Tyr 
195 200 205 

Asn Arg Lys Gin Arg Scr lie Pro Leu Scr Pro lie Val Pro Glu Scr 
2 10 2 15 2 2 0 

Scr Asp Pro Ala Ala Leu Lys Arg Ala Arg Asn Thr Glu Ala Ala Arg 
225 230 235 240 

Arg Scr Arg Ala Arg Lys Leu Gin Arg Leu Lys Gin lie Glu Asp Lys 
245 250 255 

Leu Glu Glu Me Leu Scr Lys Leu Tyr His lie Glu Asn Glu Leu Ala 
260 265 270 

Arg lie Lys Lys Leu Leu Gly Glu Arg 
2 7 5 2 8 0 

( 2 ) INFORMATION FOR SEQ ID NO:7: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 3 amino acids 
( B ) TYPE: amino add 
( D ) TOPOLOGY: linear 

( i i ) MOLECULE TYPE: peptide 

( x I ) SEQUENCE DESCRIPTION: SEQ ID NO:7: 



Arg Gly Asn 
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( 2 ) INFORMATION FOR SEQ ID NO:8: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 6 amino acids 
( B ) TYPE: amino acid 
( D ) TOPOLOGY: linear 

( i i ) MOLECULE TYPE: peptide 

( x i ) SEQUENCE DESCRIPTION: SEQ ID NO*: 

Arg Gly Gly Asn Pro Gl u 
1 5 



( 2 ) INFORMATION FOR SEQ ID NO:9: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 5 unino acid] 
( B ) TYPE: amino acid 
( D ) TOPOLOGY: linear 

( i i ) MOLECULE TYPE: peptide 

( x 1 ) SEQUENCE DESCRIPTION: SEQ ID NO& 

Gly Gly Asn Gin Alt 
1 5 



( 2 ) INFORMATION FOR SEQ ID NO:10: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 20 base pain 
( B ) TYPE: nucleic acid 
( C ) STRANDEDNESS: double 
( D ) TOPOLOGY: linear 

( i i ) MOLECULE TYPE: DN A (genomic) 

( x i ) SEQUENCE DESCRIPTION: SEQ ID NO:l(h 

A AC ATGTCC C A A C ATGTTG 



( 2 ) INFORMATION FOR SEQ ID N0:11: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 58 base pain 
( B ) TYPE: nucleic add 
( C ) STRANDEDNESS: double 
( D ) TOPOLOGY: linear 

( i i )MOI£Ctn^TYPE:DNA(|raoinic) 

( x i ) SEQUENCE DESCRIPTION: SEQ ID NOUl: 

OGTC A AGTTG GGACACGTCC GGCGTCGGCT GTCGG AGG AG CTAAGTCCTG ACATGTCT 



( 2 ) INFORMATION FOR SEQ ID NO:12: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 234 base pain 
( B ) TYPE: nucleic acid 
( C ) STRANDEDNESS: double 
( D ) TOPOLOGY: linear 

( i i )MOLECUIJBTYPE:DNA(scfloimc) 

( i x ) FEATURE: 

( A ) NAME/KEY: CDS 
( B ) LOCATION: 1-234 

( x i ) SEQUENCE DESCRIPTION: SEQ ID NCH12: 



GCC CCC CCG ACC GAT GTC AGC CTG GGG G AC O AG CTC CAC TTA G AC GGC 



47 



5,573,925 
-continued 



48 



Ala Pro Pro Th r Asp Val Scr Leu Oly Asp Olu Leu His Leu Asp Gly 
1 5 10 15 

GAG G AC GTG GCG A T G GCG CAT GCC GAC GCG CTA G AC GAT TTC GAT CTG 96 
Glu Asp Val Ala Met Ala His Ala Asp Ala Leu Asp Asp Pbc Asp Leu 
2 0 2 5 3 0 

GAC AT G TTG GGG GAC GGG GAT TCC CCG GGG CCG GGA TTT ACC CCC CAC 144 
Asp Mel Leu Gly Asp Gly Asp Scr Pro Gly Pro Gly Phc Tbr Pro His 
3 5 4 0 4 5 

GAC TCC GCC CCC TAC GGC GCT CTG GAT ATG GCC GAC TTC GAG TTT GAG 192 
Asp Scr Ala Pro Tyr Gly Ala Leu Asp Mcl Ala Asp Phc Glu Phc Glu 
SO 5 5 6 0 

CAG ATG TTT ACC GAT GCC CTT GGA ATT GAC GAG TAC GGT GGG 234 
Gin Mcl Phc Tor Asp Ala Leu Gly lie Asp Glu Tyr Gly Gly 
6 5 7 0 7 5 

( 2 ) INFORMATION FOR SEQ ID NO: 13: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 78 amino acids 
( B ) TYPE: amino acid 
( D ) TOPOLOGY: linear 

( i i ) MOLECULE TYPE: protein 

( x i ) SEQUENCE DESCRIPTION: . SEQ ID NO: 13: 

Ala Pro Pro Th r Asp Val Scr Leu Gly Asp Glu Leu His Leo Asp Gly 
1 5 10 15 

Glu Asp Vtl Ala Met Ala His Ala Asp Ala Leu Asp Asp Pbc Asp Leu 
2 0 2 5 3 0 

Asp Mcl Leu Gly Asp Gly Asp Scr Pro Gly Pro Gly Phc Tbr Pro His 
3 5 4 0 4 5 

Asp Scr Ala Pro Tyr Gly Ala Leu Asp Mcl Ala Asp Phc Glu Pbc Glu 
5 0 5 3 6 0 

Gin Met Phc Tbr Asp Ala Leu Gly lie Asp Glu Tyr Gly Gly 
6 5 7 0 7 5 



( 2 ) INFORMATION FOR SEQ ID NO:14: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 7 amino acids 
( B ) TYPE: amino acid 
( D ) TOPOLOGY: linear 

( i i ) MOLECULE TYPE: peptide 

( x 1 ) SEQUENCE DESCRIPTION: SEQ ID NO: 14: 

Pro Lys Lys Lys Arg Lys Val 
1 5 



( 2 ) INFORMATION FOR SEQ ID NO: 15: 

( i ) SEQUENCE CHARACTERISTICS: 

( A ) LENGTH: 390 amino acids 
( B ) TYPE: amino acid 
( D ) TOPOLOGY: linear 

( i i ) MOLECULE TYPE: protein 

( x i ) SEQUENCE DESCRIPTION: SEQ ID NO:15: 

Mcl Tbr Ala Mcl Glu Glu Scr Gin Scr Asp lie Scr Leu Glu Leu Pro 
1 5 10 15 

Leu Scr Gin Glu Thr Phc Scr Gly Leu Trp Lys Leu Leu Pro Pro Glu 
2 0 2 5 3 0 

Asp lie Leu Pro Scr Pro His Cys Mcl Asp Asp Leu Leu Leu Pro Gin 
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3 5 40 4 5 

Asp Val Glu Ota Phc Phc Ola Gly Pro Scr Glo Ala Leu Arg Vtl Scr 
5 0 5 5 6 0 

Gly Ala Pro Ala Ala Gin Asp Pro Val Thr Glu Thr Pro Gly Pro Val 
65 70 75 80 

Ala Pro Ala Pro Ala Thr Pro Trp Pro Leu Scr Scr Phc Val Pro Scr 
8 5 9 0 °5 

Gin Lys Thr Tyr Gin Gly Asa Tyr Gly Phc Hi J Leu Gly Phc Leu Gin 
100 10 5 110 

Scr Gly Thr Ala Lyi Ser Val Met Cys Thr Tyr Scr Pro Pro Leu Asn 
115 12 0 12 5 

Lys Leu Phc Cys Gin Leu Val Lys Thr Cys Pro Val Gin Leu Trp Val 
13 0 13 5 14 0 



r Ala Thr Pro Pro Ala Oly Scr Arg Val Arg Ala Mcl Ala lie Tyr 
15 0 15 5 16 0 



S c 
1 4 5 

Lvs Lya Scr Gin His Met Thr Glu Val Val Arg Arg Cya Pro His His 
' 165 170 175 

Glu Arg Cys Ser Asp Oly Asp Gly Leu Ala Pro Pro Gin His Leu lie 
18 0 18 5 190 

Arg Val Olu Gly Asn Lou Tyr Pro Glu Tyr Leu Glu Asp Arg Gin Thr 
195 200 205 

Phc Arg His Scr Val Val Val Pro Tyr Glu Pro Pro Olu Ala Gly Scr 
2 10 2 15 220 

Glu Tyr Thr Thr lie His Tyr Lys Tyr Mcl Cys Asn Scr Scr Cys Mcl 
225 230 235 240 



Gly Gly Met Asn Arg Arg Pro lie Leu Thr 



lc lie Thr Leu Glu Asp 



24 5 



250 255 



! c r Scr 



Gly Asn Leu Leu Gly Arg Asp Scr Phc Glu Val Arg Val Cyi 
260 265 270 



Ala Cys Pro Gly Arg Asp Arg Arg Thr Glu Olu Glu Asn Phc Arg Lys 

275 280 285 

Lys Glu Val Leu Cys Pro Glu Leu Pro Pro Gly Scr Ala Lys Arg Ala 

290 295 300 

Leu Pro Thr Cys Thr Scr Ala Ser Pro Pro Gin Lys Lys Lys Pro Leu 

305 310 315 320 

Asp Gly Glu Tyr Phc Thr Leu Lys lie Arg Gly Arg Lys Arg Phc Glo 

325 330 335 

Mcl Phc' Arg Glu Leu Asn Glu Ala Leu Glu Leu Lys Asp Ala His Ala 



3 4 0 



345 350 



Thr Glu Glu Scr Gly Asp Scr Arg Ala His Scr Scr Tyr Leu Lys Thr 

3 5 5 3 60 3 6 5 

Lys Lys Gly Gin Scr Thr Scr Arg His Lys Lys Thr Mcl Val Lys Lys 

370 375 380 

Val Gly Pro Asp Sor Asp 

38 5 3 90 



( 2 ) INFORMATION FOR SEQ ID NO:16: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 26 baso pairs 
( B ) TYPE: nucleic acid 
( C ) STRANDEDNESS: double 
( D ) TOPOLOGY: linear 

( i i ) MOLECULE TYPE: DNA (genomic) 



( % i ) SEQUENCE DESCRIPTION: SEQ ID NO:16: 
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CCCGAACATG TCCCAACATG TTGGGG 

( % ) INFORMATION FOR SEQ ID NO:17: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 30 base pairs 
( B ) TYPE* nucleic acid 
( C ) STRANDED NESS: double 
( D ) TOPOLOGY: linear 

( i i ) MOLECULE TYPE: DNA (genomic) 

( x i ) SEQUENCE DESCRIPTION: SEQ ID NO:17: 

TCGAGCATGT TCGAGCATGT TCGAGC ATGT 



2 6 



3 0 



( 2 ) INFORMATION FOR SEQ ID NO:18: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 30 base pain 
( B ) TYPE: nucleic acid 
( C ) STRANDED NESS: double 
( D ) TOPOLOGY: linear 

( i i ) MOLECULE TYPE: DNA (genomic) 

( x i ) SEQUENCE DESCRIPTION: SEQ ID NO:18: 

CCGGGCATGT CCGGGC ATGT CCGGGCATGT 



30 



( 2 ) INFORMATION FOR SEQ ID NO: 19: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 47 base pain 
( B ) TYPE: nucleic acid 
( C ) STRANDED NESS: double 
( D ) TOPOLOGY: linear 

( i i ) MOLECULE TYPE: DNA (genomic) 

( x i ) SEQUENCE DESCRIPTION: SEQ ID NO: 19: 

T A TGGGC ATG TCCT AT AT AT ATGCGTATAT AT ATGGGC AT GTCCTAT 



( 2 ) INFORMATION FOR SEQ ID NO:20: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 1215 base pain 
( B ) TYPE: nucleic acid 
( C ) STRAND EDNESS: double 
( D ) TOPOLOGY: linear 

( i i ) MOLECULE TYPE: DNA (genomic) 

{ x i ) SEQUENCE DESCRIPTION: SEQ ID N020: 



GAATTCAACC 


AGCAGCCTCC 


CGCGACCATG 


GAGGAGCCGC 


AGTCAGATCC 


T AGCGTC G AG 


60 


CCCCCTCTGA 


GTCAGGAAAC 


ATTTTC AG AC 


CTATGGAAAC 


TACTTCCTGA 


AAACAACGTT 


1 2 0 


CTGTCCCCCT 


TGCCGTCCCA 


AGCAATGG AT 


G ATT T G AT GC 


TGTCCCCCGA 


CG AT ATTGA A 


1 80 


C A A TGGTTC A 


CTGAAGACCC 


AGGTCC AG AT 


G AAGCTCC C A 


GAATGCC AGA 


GGCTGCTCCC 


2 4 0 


CCCGTGGCCC 


CTGC ACCAGC 


AGCTCCT AC A 


CCGGCCGCCC 


CTGC ACC AQC 


CCCCTCCTGG 


3 00 


CCCCTGTCAT 


CTTCTGTCCC 


TTCCCAGAA A 


ACCTACCAGG 


GCAGCTACGG 


TTTCCGTCTG 


3 6 0 


GGCTTCTTGC 


ATTCTGGG AC 


AGCCAAGTCT 


GTGAC TTGC A 


CGTACTCCCC 


TGCCCTCAAC 


4 2 0 


AAGATGTTTT 


GCC A A CTGGC 


GAAGACCTGC 


CCTGTGC AGC 


TGTGGGTTGA 


TTCCACACCC 


4 8 0 


CCGCCCGGCA 


CCCGCGTCCG 


CGCCATGGCC 


ATCTACAAGC 


AGTC ACAGCA 


C ATG ACGGAG 


5 4 0 


GTTGTGAGGC 


GCTGCCCCCA 


CC ATGAGCGC 


TOCTCAGATA 


GCGATGGTCT 


GGCCCCTCCT 


6 0 0 
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CAOCATCTTA 


TCCG AGTGGA 


AGGAAATTTG 


CGTGTGO AGT 


ATTTGG A TG A 


CAGAAAC ACT 


6 6 0 


TTTCGAC ATA 


GTGTGGTGGT 


ACCCT ATG AO 


CCGCCTGAGG 


TTGGCTCTGA 


CTGTACC ACC 


7 2 0 


ATCC ACTACA 


ACTA C ATGTG 


T A AC AOTTCC 


TGC ATGGGCG 


GCATG A ACCG 


GAGGCCC ATC 


7 8 0 


CTC ACC AT C A 


TC AC ACTGGA 


AG ACTCC AGT 


GGTAATCTAC 


TGGG ACGG A A 


CAGCTTTGAG 


8 40 


GTOCGTQTTT 


GTGCCTGTCC 


TGGG AGAGAC 


CGGCGC ACAO 


AGG A AG AG A A 


TCTCCGC AAG 


9 0 0 


AA AOGOGAGC 


CTCACC ACGA 


GCTCCCCCCA 


GGG AGC ACT A 


AGCG AG C ACT 


GCCCA ACAAC 


9 6 0 


ACCAGCTCCT 


CTCCCC AGCC 


AA AOAAG AAA 


CC ACTGGATG 


GAG A AT A T TT 


CACCCTTCAG 


10 2 0 


ATCCGCGGGC 


GTGAGCGCTT 


CO AAATGTTC 


CGAGAGCTGA 


ATGAGGCCTT 


GGAACTC AAG 


10 8 0 


GATGCCCAGG 


CT GGG A AGO A 


GCC AGGGGGG 


AG C AGGGC TC 


ACTCC AGCCA 


CCTGA AGTCC 


114 0 


A A A A AGOG TC 


AGTCTACCTC 


CCGCCAT A AA 


A A AC TC ATGT 


TC A AG A C AG A 


AGGGC CT G AC 


120 0 


TC AGACTGAG 


TCGAC 
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( 2 ) INFORMATION FOR SEQ ID N021: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 30 base pain 
( B ) TYPE: nucleic acid 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY: linear 

( i i ) MOLECULE TYPE: DNA (genomic) 

( x 1 ) SEQUENCE DESCRIPTION: SEQ ID NCK21: 

GC AG AGGAGC A A A AGC TTG A AGACAAGOTT 



( 2 ) INFORMATION FOR SEQ ID N022: 

( 1 ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 34 base pairs 
( B ) TYPE: nucleic acid 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY: linear 

( i i ) MOLECULE TYPE: DNA (genomic) 

( x i ) SEQUENCE DESCRIPTION: SEQ ID NO:22: 

CTTT AATC A A CCOCTTGCGA CTCAGCTGGA CTTC 



( 2 ) INFORMATION FOR SEQ ID NO:23: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 32 base pairs 
( B ) TYPE: nucleic add 
( C ) STRANDEDNESS: single . 
( D ) TOPOLOGY: linear 

( i i ) MOLECULE TYPE: DNA (genomic) 

( x i ) SEQUENCE DESCRIPTION: SEQ ID N023: 

ATOAGGCCTT GGAAGACAAG GTTG A AG AAT TG 



( 2 ) INFORMATION FOR SEQ ID NO:24: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 7 base pain 
( B ) TYPE: nucleic acid 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY: linear 

( i i ) MOLECULE TYPE: DNA (genomic) 



( x i ) SEQUENCE DESCRIPTION: SEQ ID NO:24: 
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( 2 ) information for seq id no:25: 

c i ) sequence characteristics: 

( A ) LENGTH: 9 base pain 
( B ) TYPE: nucleic acid 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY: linear 

( i i ) MOLECULE TYPE: DNA (genomic) 

( x i ) SEQUENCE DESCRIPTION: SEQ ID NO:25: 

GACGCCCGC 9 

( 2 ) INFORMATION FOR SEQ ID NO:26: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 26 base pairs 
( B ) TYPE: nucleic acid 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY: linear 

( i i ) MOLECULE TYPE: DNA (genomic) 

( x i ) SEQUENCE DESCRIPTION: SEQ ID NO:26: 

COAAATGTTC CGAGAGCGAA TGAAAC 26 

( 2 ) INFORMATION FOR SEQ ID NO:27: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 24 base pairs 
( B ) TYPE: nucleic acid 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY: linear 

( i i ) MOLECULE TYPE: DNA (genomic) 

( x i ) SEQUENCE DESCRIPTION: SEQ ID NO:27: 

GTTTCATTCG CTCTCGGAAC ATTT 24 

( 2 ) INFORMATION FOR SEQ ID NO:28: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 73 base pairs 
( B ) TYPE: nucleic acid 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY: linear 

( i i ) MOLECULE TYPE: DNA (genomic) 

( x i ) SEQUENCE DESCRIPTION: SEQ ID NO:2S: 
TATCCGCGGT A ATCGTCTG A AACAGATCGA AGACAAGTTA GAAGAAATCC TTTCGAAGCT 60 
CTATCACATC GAG 73 

( 2 ) INFORMATION FOR SEQ ID NO:29: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 71 base pairs 
( B ) TYPE: nucleic acid 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY: linear 

( t i ) MOLECULE TYPE: DNA (genomic) 

( x i ) SEQUENCE DESCRIPTION: SEQ ID NO:29: 

GCTTCGAGAT AGTGTAGCTC TTACTCAATC GCGCGT AGTT TTTTAATAAC CCACTTGCAA 60 
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CTC AGCTGTT T 



( 2 ) INFORMATION FOR SEQ ID NO-JO: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 30 base pain 
( B ) TYPE nucleic acid 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY: linear 

( i i ) MOLECULE TYPE: DNA (genomic) 

( x 1 } SEQUENCE DESCRIPTION: SEQ ID N030: 

TTCTCCGCGG AGTGGTTTCT TCTTTOGCTG 



( 2 ) INFORMATION FOR SEQ ID NO 31: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 43 base pain 
( B ) TYPE: nucleic acid 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY: linear 

( i i ) MOLECULE TYPE: DNA (genomic) 

( x i ) SEQUENCE DESCRIPTION: SEQ ID N031: 

TATCCGCGGT GGAAATCCTG AACTGAAACA GATCGAAGAC A AG 



( 2 ) INFORMATION FOR SEQ ID NO:32: 

( i ) SEQUENCE CHARACTERISTICS: 
< A ) LENGTH: 4 amino acids 
( B ) TYPE: amino acid 
( D ) TOPOLOGY: linear 

( i i ) MOLECULE TYPE: peptide 

( x 1 ) SEQUENCE DESCRIPTION: SEQ ID N032: 

Gly Aan Pro Gin 



( 2 ) INFORMATION FOR SEQ ID N033: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 30 base pain 
( B ) TYPE: nucleic acid 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY: linear 

( i i ) MOLECULE TYPE: DNA (genomic) 

(x!) SEQUENCE DESCRIPTION: SEQ ID N033: 

TCATCGATG CGTTCGCCAA CTAATTTCTT 



( 2 ) INFORMATION FOR SEQ ID NO:34: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 35 base pain 
( B ) TYPE: nucleic acid 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY: linear 

( i i ) MOLECULE TYPE: DNA (genomic) 

( x i ) SEQUENCE DESCRIPTION: SEQ ID NO:34: 

ATG AGGCCTT GGAACTCATC ATGCCCAGG CTGGG 



( 2 ) INFORMATION FOR SEQ ID NO:35: 
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( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 40 base pain 
( B ) TYPE: nucleic acid 
( C ) STRAND EDNESS: single 
( D ) TOPOLOGY: linear 

( i i ) MOLECULE TYPE: DNA (genomic) 

( x i ) SEQUENCE DESCRIPTION: SEQ ID N035: 

TTTGTCGACT CAATCGATAC GTTCACCCAA T A A TTTTTTO 



4 0 



( 2 ) INFORMATION FOR SEQ ID NO:36: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 27 base pairs 
( B ) TYPE: nucleic acid 
( C ) STRANDED NESS: single 
( D ) TOPOLOGY: linear 

( i i ) MOLECULE TYPE: DNA (genomic) 

( x i ) SEQUENCE DESCRIPTION: SEQ ID NOi36: 

GAGAGCCCC A GTTACCATAA CTACTCT 



2 7 



( 2 ) INFORMATION FOR SEQ ID N037: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 30 base pairs 
( B )TYPE: nucleic acid 
( C ) STRANDEDNESS: doubk 
( D ) TOPOLOGY: linear 

( i i ) MOLECULE TYPE: DNA (genomic) 

( x i ) SEQUENCE DESCRIPTION: SEQ ID N037: 

ATC ACGTG AT ATCACGTGAT ATCACGTGAT 



3 0 



What is claimed is: 

1. A nucleotide sequence encoding a modified p53 pro- 
tein, said modified p53 protein comprising: 

a p53 sequence consisting of amino acid residues 1 to 
between about 290 to about 334 of human p53 and a 
heterologous tetramerization domain consisting of a 
modified leucine zipper sequence containing isoleu- 
cines and positions d of the coiled coil and leucines at 
positions a, 

said modifications providing said protein with the inabil- 
ity to hetero-oligomerize with wild-type p53 or tumor- 
derived mutant p53 proteins, and said modifications not 
interfering with the native tumor-suppression function 
of the protein or with its ability to form tetramers. 50 

2. The nucleotide sequence according to claim 1, said 
sequence under the control of regulatory sequences directing 
its replication and in vivo or in vitro expression of said 
protein in a selected host cell. 

3. A nucleotide sequence encoding a chimeric p53 protein 55 
comprising: 

an N-terminal p53 sequence encoding amino acid residue 
I to at least residue 290 and at most residue 334; 

a heterologous tetramerization sequence consisting of a ^ 
modified leucine zipper sequence containing isoleu- 
cines at positions d of the coiled coil and leucines at 
positions a; and 

a C-terminaJ p53 fragment spanning amino acids about 
352 to 393, 65 

said modifications providing said protein with the inabil- 
ity to hetero-oligomerize with wild-type p53 or tumor- 



derived mutant p53 proteins and said modifications not 
interfering with the native tumor-suppression function 
of p53 or with its ability to form tetramers. 

4. The nucleotide sequence according to claim 1 or claim 
3, encoding a protein having the DNA binding and tran- 
scriptional functions characteristic of wild-type p53. 

5. The nucleotide sequence according to claim 1 wherein 
said N-terminal p53 sequence is derived from the full-length 
sequence by deletion of the residues selected from the group 
consisting of amino acids 301-393, amino acids 324-393, 
amino acids 326-393 and amino acids 335-393 of SEQ ID 
NO: 2. 

6. A nucleotide sequence encoding a modified p53 protein 
selected from the group consisting of: 

(a) a p53 sequence spanning, from N-terminus to C-ter- 
minus, residues 1-334 of SEQ ID NO: 2, an Asn linker, 
and a heterologous sequence spanning residues 
249-281 of SEQ ID NO: 6 containing isoleucines at 
positions d of the coiled coil and leucines at positions 
a; 

(b) a p53 sequence spanning, from N-terminus to C-ter- 
minus, residues 1-334 of SEQ ID NO: 2, a Gly-Asn- 
Pro-Glu SEQ ID NO: 32 linker, and a heterologous 
sequence spanning residues 250-281 of SEQ ID NO: 6 
containing is leucines at positi ns d of the coiled coil 
and leucines at positions a; 

(c) a p53 sequence spanning, from N-terminus to C-ter- 
minus, residues 1-325 of SEQ ID NO: 2, an Arg-Gly- 
Asn SEQ ID NO: 7 linker, and the heterologous 
sequence of (a) above; 
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(d) a p53 sequence spanning, from N-terminus to C-ter- 
minus, residues 1-325 of SEQ ID NO: 2, an Arg-Gly- 
Gly-Asn-Pro-Glu SEQ ID NO: 8 linker, and the heter- 

logous sequence of (b) above; 

(e) a p53 sequence spanning, from N-terminus to C-ter- 5 
minus, residues 1-323 of SEQ ID NO: 2, an Arg-Gly- 
Asn SEQ ID NO: 7 linker, and the heterologous 
sequence of (a) above; 

(f) a p53 sequence spanning, from N-terminus to C-ter- 
minus, residues 1-323 of SEQ ID NO: 2, an Arg-Gly- 
Gly-Asn-Pro-Glu SEQ ID NO: 8 linker, and the heter- 
ologous sequence of (b) above; and 

(g) a p53 sequence spanning, from N-terminus to C-ter- 
minus, residues 1-300 of SEQ ID NO: 2, a Gly-Gly- 15 
Asn-Gln-Ala SEQ ID NO: 9 linker, and the heterolo- 
gous sequence of (b) above. 

7. A nucleotide sequence encoding a modified p53 protein 
having restricted DNA binding specificity which comprises 

a human p53 sequence and a selected heterologous tetramer- ^ 
ization domain, wherein said p53 sequence contains a dele- 
tion selected from the group consisting of a single deletion 
of amino acids 300-327 of SEQ ID NO: 2 and a double 
deletion of amino acids 290-297 and 300-321 of SEQ ID 
NO: 2, wherein said deletion disrupts the native p53 tet- ^ 
ramerization domain. 

8. A nucleotide sequence encoding a modified p53 protein 
comprising amino acid residues 1 to about 290 of human p53 
and a heterologous tetramerization domain consisting of a 
leucine zipper which has been modified to contain isoleu- ^ 
cines at positions d and leucines at positions a of the leucine 
zipper coil, said modified p53 protein characterized by the 
inability to hetero-oligomerize with wild-type p53 or tumor- 
derived mutant p53 proteins and not interfering with the 
native tumor-suppression function of the p53 protein or with 35 
its ability to form tetramers. 

9. A nucleotide sequence encoding a chimeric p53 protein . 
selected from the group consisting of: 

(a) a p53 sequence spanning, from N-terminus to C-ter- 
minus, residues 1-325 of SEQ ID NO: 2, an Arg-Gly- 40 
Asn SEQ ID NO: 7 linker, and a heterologous sequence 
spanning residues 249-281 of SEQ ID NO: 6 contain- 
ing, isoleucines at positions d of the coiled coil and 
leucines at positions a, an lie linker, and a p53 sequence 
spanning residues 352-393 of SEQ ID NO: 2; 45 

(b) a p53 sequence spanning, from N-terminus to C-ter- 
minus, residues 1-325 of SEQ ID NO: 2, an Arg-Gly- 
Gly-Asn-Pro-Glu SEQ ID NO: 8 linker, the heterolo- 
gous sequence spanning residues 250-281 of SEQ ID 
NO: 6 containing isoleucines at positions d of the 50 
coiled coil and leucines at positions a, an He linker, and 

a p53 sequence spanning residues 352-393 of SEQ ID 
NO: 2; 

(c) a p53 sequence spanning, from N-terminus to C-ter- 
minus, residues 1-323 of SEQ ID NO: 2, an Arg-Gly- 55 
Asn SEQ ID NO: 7 linker, the heterologous sequence 

of (a) above, an Be linker, and a p53 sequence spanning 
residues 352-393 of SEQ ID NO: 2; 
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(d) a p53 sequence spanning, from N-terminus to C-ter- 
minus, residues 1-323 of SEQ ID NO: 2, an Arg-Gly- 
Gly-Asn-Pro-Glu SEQ ID NO: 8 linker, the heterolo- 
gous sequence of (b) above, an Be linker, and a p53 
sequence spanning residues 352-393 of SEQ ID NO: 2; 
and 

(e) a p53 sequence spanning, from N-terminus to C-ter- 
minus, residues 1-334 of SEQ ID NO: 2, an Asn linker, 
the heterologous sequence of (a) above, an He linker, 
and a p53 sequence spanning residues 352-393 of SEQ 
ID NO: 2; 

(f) a p53 sequence spanning, from N-terminus to C-ter- 
rninus, residues 1-300 of SEQ ID NO:2, a Gly-Gly- 
Asn-Gln-Ala SEQ ID NO:9 linker, the heterologous 
sequence of (b) above, an Be linker, and a p53 sequence 
spanning residues 352-393 of SEQ ED NO:2; and 

(g) a p53 sequence spanning, from N-terminus to C-ter- 
minus, residues 1-334 of SEQ ID NO:2, a Gly-Asn- 
Pro-Glu SEQ ID NO:32 linker, a heterologous 
sequence of (b) above, an He linker, and a p53 sequence 
spanning residues 352-393 of SEQ ID NO:2. 

10. The nucleotide sequence according to claim 3 or 8, 
said sequence under the control of regulatory sequences 
directing its replication and in vivo or in vitro expression of 
said protein in a selected host cell. 

11. The nucleotide sequence according to any of claims 1 
or 8 encoding a protein wherein the leucine zipper sequence 
is derived from a protein selected from the group consisting 
of Jun, Max, GCN4, c-mvc, c-fos, and C/EBP. 

12. The nucleotide sequence according to claim 11 encod- 
ing a protein wherein the leucine zipper sequence is amino 
acid residues 249 to 281 of GCN4 (SEQ ID NO:6). 

13. The nucleotide sequence according to any of claims 1, 
3 or 8, encoding a protein further comprising an amino acid 
linker between said p53 sequence and said heterologous 
tetramerization domain. 

14. The nucleotide sequence according to claim 13 encod- 
ing a protein wherein said linker is selected from the group 
consisting of Glu, Asn, He, Gly-Asn, Arg-Gly-Asn SEQ ID 
NO: 7, Gly-Gly- Asn-Gln-Ala SEQ ID NO: 9, Arg-Gly-Gly- 
Asn-Pro-Glu SEQ ID NO: 8 and Gly-Asn-Pro-Glu SEQ ID 
NO: 32. 

15. A nucleotide sequence encoding a modified p53 
protein selected from the group consisting of: 

(a) a p53 sequence spanning, from N-terminus to C-ter- 
minus, residues 1-346 of human p53 SEQ ED NO:2, a 
Glu linker, and a heterologous tetramerization domain 
spanning residues 253-281 of GCN4 SEQ ID NO:4; 
and 

(b) a p53 sequence spanning, from N-terminus to C-ter- 
minus, residues 1-347 of human p53 SEQ ID NO:2 and 
a heterologous tetramerization domain spanning resi- 
dues 253-281 of GCN4 SEQ ID NO:4. 

* * * # * 



